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(57) Abstract: The present invention relates to methods and compositions for modifying peptides, polypeptides and protein! 
polymers, especially glyco-mimetic polymers, so as to improve their biological activity or pharmacokinetic properties. The in 
further provides methods and uses foi such polymer-modified peptides, polypeptides and proteins. The invention is particularly 
suitable for use in the synthesis of polymer-modified synthetic bioactive proteins (Figure ID), and of pharmaceutical compositions 
that contain such proteins. 



WO 02/20033 Al HI 



(81) Designated States (national): AL, AM, AT, AU, AZ, BA, 
BB, BG, BR, BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, 
FT, GB, GE, GH, HU, IL, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MD, MG, MK, MN, MW, MX, 
NO, NZ, PL, PT, RO, RU, SD, SE, SG SI, SK, SL, TJ, TM, 
TR, TT, UA, UG, US, UZ, VN, YU, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ. MD, RU, T.I, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 



Published: 

— -with international search report 

— before the expiration of the time limit for amending the 
claims and to be republished in the event of receipt of 
amendments 

— with sequence listing part of description published sepa- 
rately in electronic form and available upon request from 
the International Bureau 



For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and A bbreviations " appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



WO 02/20033 



PCT/US01/21930 



Title of the Invention 

Polymer-Modified Synthetic Proteins 

Field of the Invention 

The present invention relates to methods and compositions for modifying 
5 biologically active synthetic peptides, polypeptides and proteins with polymers, 
especially glyco-mimetic polymers so as to improve their biological activity or 
pharmacokinetic properties. The invention further provides methods and uses for 
such polymer-modified peptides, polypeptides and proteins. 

Cross-Reference to Related Applications 

10 This application is a continuation in part of US Patent Applications Serial 

Nos. 60/231,330 (filed September 8, 2000) and 60/236,377 (filed September 29, 
2000), both of which applications are herein incorporated by reference in their 
entirety. 

Background of the Invention 

15 Over the past 30 years, medical attention has increasingly turned to the 

possibility of using proteins as therapeutic drugs for the treatment of disease (See, 
e.g., "Therapeutic Proteins 1999," Datamonitor pic, London, 1999). Most protein 
therapeutics are based on naturally occurring animal or human forms of the protein 
(such as insulin, growth hormone, erythropoietin, interleukin-2 etc.) and produced in 

20 recombinant DNA engineered bacterial or eukaryotic cells. Unfortunately, 
recombinant proteins made in bacterial cells, such as E. coli, often lack post- 
translational modifications that are normally found in the protein's naturally 
produced form. This lack of modification can have a significant negative impact on 
the pharmaceutical properties of the protein when introduced to a patient. 

25 Glycosylation is one example of a common post-translational modification in 

eukaryotes that involves the enzymatic attachment of complex sugar chains to the 
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protein. Because recombinant proteins made in bacterial expression systems lack the 
ability to glycosylate proteins, eukaryotic expression systems such as plant, yeast, 
insect or mammalian cells have been used in their place when a glycosylated form or 
'glycoprotein' is required. A significant problem in making recombinant 
5 glycoproteins is that the resulting product typically consists of a complex mixture of 
different 'glycoforms', which may have widely varying physiological or 
pharmacological properties. 

Nevertheless, recombinant DNA techniques have become the primary 
method for commercial production of many polypeptides and proteins because of the 

10 large quantities that can be produced in bacteria and other host cells. Recombinant 
protein production involves transfecting or transforming host cells with DNA 
encoding the desired exogenous protein and growing the cells under conditions 
favoring expression of the recombinant protein. E. coli and yeast are favored as hosts 
because they can be made to produce recombinant proteins at high titers (see, U.S. 

15 Patent 5,756,672). 

Numerous U.S. Patents have been issued with respect to general bacterial 
expression of recombinant-DNA-encoded proteins (see, for example, U.S. Patents 
4,565,785; 4,673,641; 4,738,921; 4,795,706; 4,710,473). Unfortunately, the use of 
recombinant DNA techniques has not been universally successful. Under some 

20 conditions, certain heterologous proteins expressed in large quantities from bacterial 
hosts are precipitated within the cells in dense aggregates, recognized as bright spots 
visible within the enclosure of the cells under a phase-contrast microscope. These 
aggregates of precipitated proteins are referred to as "refractile bodies," and can 
constitute a significant portion of the total cell protein (Brems et al., Biochemistry 

25 (1985) 24: 7662. 

Recovery of protein from these bodies has presented numerous problems, 
such as how to separate the protein encased within the cell from the cellular material 
and proteins harboring it, and how to recover the inclusion body protein in 
biologically active form. For general review articles on refractile bodies, see 

30 Marston, supra; Mitraki and King, Bio/Technology, 7:690 (1989); Marston and 
Hartley, Methods in Enzymol., 182: 264-276 (1990); Wetzel, "Protein Aggregation 
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In Vivo: Bacterial Inclusion Bodies and Mammalian Amyloid," in Stability of 
Protein Pharmaceuticals: In Vivo Pathways of Degradation and Strategies for Protein 
Stabilization, Ahern and Manning (eds.) (Plenum Press, 1991); and Wetzel, 
"Enhanced Folding and Stabilization of Proteins by Suppression of Aggregation In 
5 Vitro and In Vivo," in Protein Engineering--A Practical Approach, Rees, A. R. et al. 
(eds.) (IRL Press at Oxford University Press, Oxford, 1991). 

Proteins also have been produced as research reagents by total chemical 
synthesis (See, e.g., Wilkin et al., Curr. Opinion Biotech. (1999) 9:412-426). This 
process involves chemical synthesis of proteins by the stepwise coupling of 

10 individual amino acids, or convergent strategies that employ the separate synthesis 
of peptide or polypeptide segments followed by linking of the segments, including 
chemical ligation of such segments, to generate full-length products (See, e.g., 
Dawson et al., Ann. Rev. Biochem (2000) 5P:923-960). In some instances chemical 
synthesis has been utilized to make small glycoproteins modified with simple low 

15 molecular weight sugars (Shin et al., J. Am. Chem. Soc. (1999) 121:1 1684-1 1689). 
In other cases, proteins have been made chemically to contain detectable labels and 
the like (Bolin et al., FEBS Letters (1999) 45/:125-131). Unfortunately, such 
labeled proteins offer little to no therapeutic advantage over their recombinantly 
produced counterparts. However, a few proteins showing therapeutic potential have 

20 been made by chemical synthesis with a focus on improving potency by use of small 
molecule-type changes to pharmacophore regions of the target protein (U.S. Patent 
6,168,784). While potency is an important aspect in making a therapeutic, other 
factors remain problems with protein therapeutics. 

Efforts to develop therapeutically usable protein drugs have long suffered 

25 from the undesirable bioactivity, bioavailability and biokinetics (such as clearing 
time, etc.) of putative drug candidates upon in vivo administration. The principal 
factors that have severely limited the use of proteins, and of polypeptides in 
particular, as therapeutic agents has been the fact that these compounds often elicit 
an immunogenic response in the circulatory system (see, U.S. Patent 4,179,337 

30 (Davis et al.)). This effect has one or both of two secondary consequences. The first 
being the destruction of polypeptides by the elicited antibodies, and the second more 
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seriously, being the emergence of an allergic response. For example, the antibody- 
mediated destruction of insulin is believed to be responsible for the rather low 
residence time of insulin in the human circulatory system. In the case of enzymes, 
not only is there a problem of destruction of the polypeptide and the subsequent 
5 negation of its physiological activity but also the most undesired elicitation of an 
allergic reaction. Conversely, certain proteins, such as clotting factors, growth 
factors and cytokines, etc., may exhibit a biological half-life that is 
disadvantageously prolonged upon their in vivo administration. 

One approach to improving proteins for use as therapeutics has involved 

10 derivatizing the protein with water-soluble polymers. Such conjugation has been 
proposed as a means for improving the circulating life, water solubility or 
antigenicity of administered proteins, in vivo (see, U.S. Patent 4,179,337 (Davis et 
al.). Many different water-soluble polymers and attachment chemistries have been 
used towards this goal, such as polyethylene glycol, copolymers of ethylene 

15 glycol/propylene glycol, carboxymethylcellulose, dextran, polyvinyl alcohol, 
polyvinyl pyrrolidone, poly-1, 3-dioxolane, poly-l,3,6-trioxane, ethylene/maleic 
anhydride copolymer, polyaminoacids (either homopolymers or random copolymers) 
and the like. Polyethylene glycol ("PEG") is one such chemical moiety that has been 
used in efforts to obtain therapeutically usable proteins (Zalipsky, S. {Bioconjugate 

20 Chemistry (1995) 6:150-165; Mehvar, R. (J. Pharm. Pharmaceut. Sci. (2000) 
3(1): 125-136)). Its backbone (CH2CH20)„ is flexible, amphiphilic, not susceptible 
to proteases, and non-immunogenic. Attachment of PEG also has been shown to 
protect proteins against proteolysis (Blomhoff, H.K. et al., Biochim Biophys Acta 
(1983) 757:202-208). It also can improve other physical-chemical properties of 

25 proteins. 

For example, pegademase bovine (Adagen®), a. formulation of adenosine 
deaminase with an attached PEG polymer, has been developed for treating severe 
combined immunodeficiency disease (SCID); superoxide dismutase with an attached 
PEG moiety has been in clinical trials for treating head injury; alpha interferon with 
30 an attached PEG moiety has been tested for treating hepatitis. Pegylated EL-6 has 
been described (see, EP 0 442 724, and U.S. Patent 5,264,209). EP 0 154 316 
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reports reacting a lymphokine with an aldehyde of polyethylene glycol. European 
Patent Publication EP 0 401 384, describes materials and methods for preparing 
granulocyte colony stimulating factor ("G-CSF") to which polyethylene glycol 
molecules are attached. Modified G-CSF and analogs thereof are also reported in EP 

5 0 473 268, stating the use of various G-CSF and derivatives covalently conjugated to 
a water-soluble particle polymer, such as polyethylene glycol. U.S. Patent 5,880,255 
summarizes many of the proteins that have been modified with PEG chains. 

A variety of means have been used to attach polymer moieties such as PEG 
and related polymers to reactive groups found on the protein (see, U.S. Patent 

10 4,179,337 (Davis et al.), and U.S. Patent 4,002,531 (Royer). For a review, see 
Abuchowski et al., in "Enzymes as Drugs," (J. S. Holcerberg and J. Roberts, eds. 
pp. 367-383 (1981) and Zalipsky, S. (Bioconjugate Chemistry (1995) 6:150-165. 
The use of PEG and other polymers to modify proteins also is discussed by Cheng, 
T.-L. et al., Bioconjugate Chem. (1999) 10:520-528; Belcheva, N. et al., 

15 Bioconjugate Chem. (1999) 10: 932-937; Bettinger, T. et al., Bioconjugate Chem. 

(1998) 9:842-846; Huang, S.-Y. et al., Bioconjugate Chem. (1998) 9:612-617; Xu, 
B. et al. Langmuir (1997) 13:2447-2456; Schwarz, J. B. et al., J. Amer. Chem. Soc. 

(1999) 121:2662-2673; Reuter, J. D. et al., Bioconjugate Chem. (1999) 10:271-278; 
Chan, T.-H. et al., J. Org. Chem. (1997) 62:3500-3504. Typical attachment sites in 

20 proteins include primary amino groups, such as those on lysine residues or at the N- 
terminus, thiol groups, such as those on cysteine side-chains, and carboxyl groups, 
such as those on glutamate or aspartate residues or at the C-terminus. Some of the 
most common sites of attachment are to the sugar residues of glycoproteins, 
cysteines or to the N-terminus and lysines of the target proteins. 

25 Although many different approaches have been described for conjugation of 

polymers to proteins, the conjugation process is not without complications. Care 
must be taken to limit the loss of biological activity caused by the conjugation 
reaction. For example, folded or re-folded proteins are typically used to minimize 
the number of sites of attachment. However, if too much of the activated polymer is 

30 attached to the target protein or polypeptide, biological activity can be severely 
reduced or lost. Likewise, if the wrong linker joining the polymer to the protein is 
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used or an insufficient amount of polymer is attached to the target, the therapeutic 
value of the resultant conjugate may be limited, and may not demonstrate enough of 
an increase in circulating life to compensate for the loss in its bioactivity. Problems 
can also result due to a blockage of the protein's active site by the modifying 
5 polymer. This problem can be difficult to avoid since the polymer and protein are 
typically joined in solution-based reactions. Pre-blocking the active sites with 
materials such as pyridoxal phosphate has been suggested, but the results have been 
inconsistent (see, U.S. Patent 4,179,337 (Davis et al.)). These problems are 
particularly acute with lower molecular weight proteins and peptides, which often 
10 have few attachment sites not associated with bioactivity. 

For instance, a common technique has been the attachment of water-soluble 
polymers to the primary amines in the target protein (e.g., the N-terminal amino 
group and the epsilon amino groups of lysines). Thiol-reactive polymer conjugates 
also have been used to attach polymers to the thiol side-chains of cysteines. Both 
15 approaches represent significant problems as most proteins contain multiple copies 
of such reactive groups spread out over various regions of the polypeptide backbone 
that play an important role in defining the activity, folding, re-folding and stability of 
a protein. Thus although widely used, such approaches suffer from more or less 
incomplete or unwanted reduction in bioactivity of a protein, and usually complex 
20 mixtures that are difficult to separate and characterize (Delgada et al., 
Pharmaceutical Sciences (1997) 3:59-66). 

In an attempt to reduce random attachment, proteins have been made in 
which the natural lysines are removed in conjunction with adding lysines at the 
desired sites of polymer attachment (U.S. Pat. No 4,904,584). For example G-CSF 
25 and IL-2 have been modified in this manner. Other attempts to avoid random 
attachment has been to make proteins in which the natural cysteines are removed in 
conjunction with adding cysteines at the desired sites of polymer attachment, or 
cysteines are added without removing the natural cysteines if present (EP 0 668 
353). For example, G-CSF, IL-3 and EPO have been made this way. While such 
30 cysteine or lysine variants theoretically allow site-specific polymer attachment, there 
still is no guarantee that all selected sites can be modified in a controlled manner. 
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Given the inability to site-specifically modify proteins containing multiple 
amino acids with side-chains bearing the same or similar reactive functional groups, 
recent efforts have focused on the modification of the amino or carboxy terminus of 
proteins. Modification of the amino or carboxy terminus has relied on the ability of 
5 some chemical conjugation techniques to uniquely modify these sites (WO 90/02136 
and WO 90/02135). For example, this technique was utilized for the attachment of 
PEG chains to the N-terminal residue of G-CSF and the chemokine IL-8 (Gaertner et 
al, Bioconjug. Chem. (1996) 7(l):38-44; and WO 96/41813). However, 
modification of the N- or C-termini typically reduces a protein's activity (See, e.g., 
10 U.S. Patent 5,985,265 discussing attachment of PEG to the N-terminus and lysine 
side chains of G-CSF). Despite the drawbacks with modification of proteins with 
water-soluble polymers, PEGylation and attachment of other water-soluble polymers 
to proteins continues to be pursued. For example, attachment of PEG to histidine in 
IF-alpha has been described in U.S. Patent Nos. 5,951,974 and 6,042,822. 
15 Attachment of PEG to lysines in alpha interferon is described in U.S. Patent No. 
5,595,732. Attachment of PEG to sugar chains of erythropoietin (EPO) and internal 
amino acids such as lysines (EP 0 605 963 and WO 00/32772), and the N-terminus 
(U.S. Patent 6,077,939 and WO 00/32772) also has been described. 

Another problem is that all of the above-discussed polymers are highly non- 
20 homogeneous, which render analytical characterization and purification of the 
mixtures of polymer-modified proteins difficult (Delgada et al., Pharmaceutical 
Sciences (1997) 3:59-66). For example, techniques used to prepare PEG or PEG- 
based chains, even those of fairly low relative molecular mass such as 3400, involve 
a poorly controlled polymerization step which leads to preparations having a spread 
25 of chain lengths about a mean value; that is, they involve polymer preparations of 
(CH2CH20) n where n does not have a discrete value but rather has a range of values 
about a mean. The resulting heterogeneity of the derivatized proteins is often 
associated with a range of properties that one cannot easily identify much less 
separate. Although the use of polypeptide adducts might be considered to offer a 
30 possible solution in principle, the use of polypeptides is disadvantageous since they 



WO 02/20033 



PCT7US01/21930 



-8- 

are susceptible to proteolytic cleavage and can render the derivatized protein 
immunogenic. 

Unfortunately, however, the problem of identifying and employing a suitable 
polymer is exacerbated by the fact that all proteins currently employed for 
5 therapeutic use are derived from recombinant DNA technologies. The use of 
recombinant DNA technologies puts severe limitations on the types and specificity 
of linkages that can be formed between a performance-enhancing moiety and the 
recombinantly produced protein. This is because firstly there are only a very limited 
number of functional groups suitable for linkage, and secondly there will generally 
1 0 be several copies of the reactive functional group in the protein being modified, thus 
precluding any specificity of modification. For example, it has been shown that in 
the case of nonselective conjugation of superoxide dismutase with PEG, several 
fractions of the modified enzyme were completely inactive (P. McGoff et al., Chem. 
Pharm. Bull. (1988) 36:3079-3091). Also, if differing numbers of such moieties are 
15 randomly attached, the pharmacokinetics of the therapeutic protein cannot be 
precisely predictable, making dosing a large problem. The lack of control of 
attachment furthermore may lead to (a) reduced potency, (b) a need for elaborate 
purification schemes to separate a vast mixture of derivatives and (c) possibly 
unstable attachment of the modifying moiety. Several linkages, such as 
20 tresylchloride- based linkages, known to the art are also known to be immunogenic. 

Thus to improve circulating half-life, reduce proteolysis and immunogenicity 
and improve other properties of biologically produced proteins, water-soluble 
polymers such as PEG can be attached, but with mixed results given the difficulty of 
attaching them in a controlled manner and with user-defined precision. Also, 
25 because of the limited success in attaching polymers at precise user-defined sites, 
very little is known about preferred sites of attachment that could be applicable to 
proteins in general. In addition, because of the stochastic nature of attachment, and 
the hetero-disperse nature of PEG and other water-soluble polymers currently 
employed for such purposes, purification and analytical characterization of PEG- 
30 protein conjugates has been difficult. Thus the combined problem of poor control 
over reproducible attachment and polymer heterogeneity has severely hampered the 
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routine approval of polymer-modified proteins as therapeutics (only a few approved 
to date for therapeutic use despite its introduction in the early 1970's). 

Accordingly, a need exists for methods of forming bioactive proteins that are 
distinct from recombinant DNA technologies, and that could be used to form 

5 proteins capable of polymer modification. A need also exists for a preferred 
polymer that can be employed to form derivatized proteins that have polymer 
adducts of defined structure rather than a mixture of chains of different lengths. 
Polymers that can be employed to form derivatized proteins having polymer adducts 
with structures that can be tailored to mimic desirable properties of natural proteins 

10 also is needed. Moreover, a need exists for derivatizing proteins that has general 
applicability to many proteins. The present invention satisfies these and other needs. 

SUMMARY OF THE INVENTION 

The present invention concerns polymer-modified synthetic bioactive 
proteins, pharmaceutical compositions that contain them and methods for their 

15 production and use. The polymer-modified synthetic bioactive proteins of the 
invention are produced in whole or in part by chemical synthesis, and thus differ 
markedly from biologically produced proteins. 

In detail, the invention provides polymer-modified synthetic bioactive 
proteins having a monomer molecular weight greater than about 25 kilo Daltons 

20 ("kDa"). The invention further provides polymer-modified synthetic bioactive 
proteins having one or more irregular amino acid residues. The invention 
particularly provides polymer-modified synthetic bioactive proteins that possess a 
bioactivity that mimics a bioactivity associated with a ribosomally specified protein 
such as a natural mammalian protein (e.g., human, simian, bovine, murine, porcine, 

25 ovine, or equine, avian, or piscine protein). The invention more particularly 
provides such polymer-modified synthetic bioactive proteins having a bioactivity 
that mimics a bioactivity associated with a ribosomally specified protein selected 
from the group consisting of a protein receptor or fragment thereof, a protein 
receptor ligand or fragment thereof, and a cytokine. 
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In preferred embodiment, the invention is directed to a molecularly 
homogenous polymer-modified synthetic bioactive protein of the formula: 
Protein-Uw-sl-B-s2-Polymer-s3-J* 

where Protein comprises a polypeptide chain of a ribosomally specified 
5 protein, where the polypeptide chain comprises one or more non-overlapping peptide 
segments covalently bonded by one or more chemical ligation sites, U is a residue of 
a unique functional group covalently bonded to a mutually reactive unique functional 
group of a side chain n of one or more amino acids of one or more of the non- 
overlapping peptide segments, where n is a discrete integer from 1 to 6, B is a 
10 branching core having three or more arms that may be the same or different and may 
be present or absent, Polymer is a substantially non-antigenic water-soluble polymer 
that may be the same or different where B is present, J* is a residue of pendant group 
having a net charge under physiological conditions selected from the group 
consisting of negative, positive and neutral, and where si, s2, and s3 are spacer or 
1 5 linker moieties that may be the same or different, and may be individually present or 
absent. A preferred molecularly homogenous polymer-modified synthetic bioactive 
protein of the invention is one that is mono-disperse having a monomer molecular 
weight of greater than 25 kDa. 

In another embodiment, the invention is directed to a polymer-modified 
20 synthetic bioactive protein having a polypeptide chain comprising an amino acid 
sequence of a ribosomally specified glycoprotein, where the polypeptide chain has 
one or more water-soluble polymers attached thereto. In a more preferred 
embodiment, one or more of the water-soluble polymers is covalently attached at one 
or more sites of the polypeptide chain that correspond to a glycosylation site of the 
25 ribosomally specified glycoprotein. Preferred water-soluble polymers are 
polyalkylene oxide, polyamide alkylene oxide and derivatives thereof that are glyco- 
mimetic water-soluble polymers. The most preferred are polymer-modified 
synthetic proteins comprising a polypeptide chain of a cytokine glycoprotein. 

The invention further concerns pharmaceutical compositions comprising 
30 such polymer-modified synthetic bioactive proteins of the invention. The invention 
further provides a method of treating a human disease or condition that comprises 
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administering to an individual in need of such treatment an effective amount of a 
pharmaceutical composition comprising one or more of such pharmaceutical 
compositions of the invention. 

The invention also is directed to methods of producing polymer-modified 
5 synthetic bioactive proteins of the invention. A preferred method for producing the 
synthetic bioactive proteins of the invention comprises chemically ligating peptide 
segments comprising non-overlapping amino acid sequences of a polypeptide chain 
of the polymer-modified synthetic protein, where one or more of the peptide 
segments used for ligation has a water-soluble polymer attached thereto at a user- 
10 defined and pre-selected site. The polymer-modified polypeptide chain may then be 
folded to produce a polymer-modified synthetic bioactive protein of the invention. . 

Another preferred method for producing the synthetic bioactive proteins of 
the invention comprises chemically ligating peptide segments comprising non- 
overlapping amino acid sequences of a polypeptide chain of a synthetic bioactive 
15 polymer-modified protein of the invention, and attaching one or more water-soluble 
polymers to a side-chain of an amino acid at one or more chemical ligation sites 
thereof. The polymer-modified polypeptide chain may then be folded to produce a 
polymer-modified synthetic bioactive protein of the invention. 

The invention also is drawn to a method of producing a polymer-modified 
20 synthetic bioactive protein comprising (1) chemically ligating peptide segments 
comprising non-overlapping amino acid sequences of a polypeptide chain of a 
synthetic bioactive protein to form a full-length polypeptide chain corresponding to 
the synthetic bioactive protein, where at least one peptide segment comprises an 
irregular amino acid having a first chemoselective functional group, (2) folding the 
25 polypeptide chain, and (3) attaching a water-soluble polymer thereto that comprises 
a second chemoselective group that is uniquely and mutually reactive with the first 
chemoselective group. 

Another embodiment of the invention is directed to molecularly 
homogeneous glyco-mimetic water-soluble polymers of the formula U-sl-B-s2- 
30 Polymer-s3-J*, where U is a residue of a unique functional group, B is a branching 
core having three or more arms that may be the same or different and may be present 
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or absent, Polymer is a polyamide having a molecular weight greater than about 
5,000 Daltons of the formula -[C(0)-X-C(0)-NH-Y-NH]n- or -[NH-Y-NH-C(O)- 
X-C(0)]n-, where X and Y are divalent radicals that may be the same or different 
and may be branched or linear, and n is a discrete integer from 2 to 50, and where 

5 either or both of X and Y comprises a substantially non-antigenic water-soluble 
repeat unit that may be linear or branched, J* is a residue of a substantially non- 
antigenic pendant group having a net charge under physiological conditions selected 
from the group consisting of negative, positive and neutral, and where si, s2, and s3 
are spacer or linker moieties that may be the same or different, and may be 

10 individually present or absent. 

The invention further concerns a molecularly homogeneous water-soluble 
polymer of the formula: 

U-sl-B-s2-PoIymer-s3-J* 

where U is a residue of a unique functional group, B is a branching core having three 

15 or more arms that may be the same or different and may be present or absent, 
Polymer is a polyamide having a molecular weight greater than about 5,000 Da of 
the formula -[C(0)-X-C(0)-NH-Y-NH]n- or -[NH-Y-NH-C(0)-X-C(0)]n-, where 
X and Y are divalent radicals that may be the same or different and may be branched 
or linear, and n is a discrete integer from 2 to 50, and where either or both of X and 

20 Y comprises a substantially non-antigenic water-soluble repeat unit that may be 
linear or branched, J* is a residue of a substantially non-antigenic pendant group 
having a net charge under physiological conditions selected from the group 
consisting of negative, positive and neutral, and where si, s2, and s3 are spacer or 
linker moieties that may be the same or different, and may be individually present or 

25 absent, and if present preferably comprise from 1 to 18 carbons. 

The invention particularly concerns such molecularly homogeneous water- 
soluble polymers wherein U is a residue of a functional group selected from the 
group consisting of acrylate, aldehyde, ketone, aminooxy, amine, carboxylic acid, 
ester, thioester, halogen, thiol, cyanoacetate, dipalmitoyl phosphatidylethanolamine, 

30 distearoyl phosphatidylethanolamine, epoxide, hydrazide, azide, isocyanate, 
maleimide, methacrylate, nitrophenyl carbonate, orthopyridyl disulfide, silane, 
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sulfhydryl, vinyl sulfones, succinimidyl glutarate, succimidyl succinate, succinic 
acid, tresylate and an activatable functional group. 

The invention further concerns such molecularly homogeneous water-soluble 
polymers wherein U is a residue of a functional group capable of forming a bond 

5 selected from the group carbonate, ester, urethane, orthoester, amide, amine, oxime, 
imide, urea, thiourea, thioether, thiourethane, thioester, ether, thaizolidine, 
hydrazone, and oxazolidine, and particularly such polymers wherein U is protected. 

The invention further concerns such molecularly homogeneous water-soluble 
polymers, wherein the polymer has a molecular weight greater than about 10,000 Da, 

1 0 and more preferably greater than about 15,000 Da. 

The invention further concerns such molecularly homogeneous water-soluble 
polymers wherein B comprises four or more arms, and/or wherein B comprises a 
branching core joined to s2 or Polymer through one or more amide bonds, and/or 
wherein B comprises a branching core joined to Polymer through amide bonds. 

15 The invention further concerns such molecularly homogeneous water-soluble 

polymers wherein the repeat unit comprises an ethylene oxide of the formula -(CH 2 - 
CH 2 -0)- or -(CH2-CH2-O)-; and/or wherein X, Y, or X and Y are selected from: - 
((C^m-CCHj-CHz^na-C^ni-)- or -((CH 2 ) nl -(0-CH2-CH 2 )n2-(CH 2 ) nl -), where 
nl is a discrete integer from 1 to 6, n2 is a discrete integer from 2 to 50; and/or 

20 wherein one of X or Y is selected from the group consisting of phenyl, a C1-C1 0 
alkylene moiety, a Cj-Cio alkyl group, a heteroatom-containing phenyl, a 
heteroatom-containing C1-C10 alkylene moiety, a heteroatom-containing Ci-Cifj 
alkyl group, and a combination thereof. 

The invention particularly concerns such molecularly homogeneous water- 

25 soluble polymers wherein X is -(CH 2 -CH 2 )- , or -X'-NH- or -NH-X' -. 

The invention particularly concerns such molecularly homogeneous water- 
soluble polymers wherein Y is -(CH 2 -(CH2-CH2-0)3-CH2-CH 2 -CH 2 )n- or -(CH 2 - 
CH 2 -CH 2 -(0-CH 2 -CH 2 )3-CH 2 )-n , where n is 6 to 36; or -Y'-NH- or -NH-Y% 
especially wherein at least one of X' and Y' comprise a polyethylene oxide of 

30 formula (-CH2-CH 2 -0-)n or (0-CH 2 -CH 2 -)n where n is 2 to 50.. 
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The invention particularly concerns such molecularly homogeneous water- 
soluble polymers wherein J* is protected, or comprises (i) a non-ionizable group 
having a net neutral charge under physiological conditions, or (ii) an ionizable group 
that comprises a net positive charge under physiological conditions, or (iii) an 
5 ionizable group that comprises a net negative charge under physiological conditions. 

The invention particularly concerns such molecularly homogeneous water- 
soluble polymers wherein the water-soluble polymer is mono-disperse. 

BRIEF DESCRIPTION OF THE FIGURES 

Figures 1A - IE depict schematics of processes for preparing the polymer- 
1 0 modified synthetic bioactive proteins of the invention. 

Figures 2A - 2C depict schematics of processes for preparing synthetic 
bioactive proteins of the invention. 

Figures 3 A - 3B depict schematics of processes for multi-segment ligations 
that involve the chemical ligation of three or more non-overlapping peptide 
15 segments, i.e., at least one segment is a middle segment corresponding to the final 
full-length ligation product. 

Figures 4A - 4C illustrate native chemical ligation and chemical 
modification of the resulting side-chain thiol. 

Figures 5A - 5B depict solid phase process for generating the branching core 
20 (B) and unique chemoselective functional group (U) of the water-soluble polymer U- 
B-Polymer-J* of the invention. 

Figures 6A - 6D depict a solid phase process for generating preferred 
substantially non-antigenic water-soluble polyamide Polymer-J* components of the 
invention for subsequent attachment to the U-B core. 
25 Figure 7 depicts process for coupling the U-B component to Polymer-J* 

component to generate the preferred synthetic polymer constructs of the invention of 
the formula U-B-Polymer-J*. 
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Figure 8 depicts an alternative route for precision attachment of a water- 
soluble polymer to a peptide segment employable for ligation and production of 
bioactive synthetic proteins of the invention. 

Figure 9 illustrates the overall process of preparing the synthetic bioactive 
5 proteins of the present invention. 

Figure 10 depicts the basic structure of a preferred type of synthetic 
erythropoiesis stimulating proteins. pPEG = "precision PEG" 

Figure 11 shows the general structure of circularly permuted SEP analogs 
having a relocated amino and carboxy terminus. 
10 Figure 12 shows the basic structure of synthetic bioactive GCSF proteins. In 

the figure, "J" designates a non-naturally encoded residue having a hydrophobic side 
chain. 

Figure 13 shows the structure of preferred synthetic RANTES analogs. In 
the Figure, NNY = nonanoyl, X = non-naturally encoded amino acid having a 
1 5 hydrophobic side chain, and FA = fatty acid. 

Figure 14 shows a structure of a preferred water-soluble polymer, and 
various linear and branched constructs thereof. 

Figure 15 depicts schematically the formation of branched-chain pPEG 
polymers. 

20 Figure 16 depicts the synthesis of a synthetic cytokine designated SEP1-L30, 

which is a precision polymer-modified synthetic analog of human EPO prepared as 
described in Example 2. 

Figure 17 depicts a synthetic cytokine designated SEP1-L30, which is a 
precision polymer-modified synthetic analog of human EPO prepared as described in 

25 Example 2. 

Figure 18 depicts a synthetic cytokine designated SEP1-L26, which is a 
precision polymer-modified synthetic analog of human EPO prepared as described in 
Example 3. 

Figure 19 depicts schematically the formation of a preferred branched-chain 
30 water-soluble polymer that is attached to a synthetic cytokine designated SEP1-B50, 
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which is a precision polymer-modified synthetic analog of human EPO prepared as 
described in Example 4. 

Figure 20 depicts a synthetic cytokine designated SEP1-B50, which is a 
precision polymer-modified synthetic analog of human EPO prepared as described in 
5 Example 4. 

Figure 21 depicts the synthesis of a synthetic cytokine designated SEP3-L42, 
which is a precision polymer-modified synthetic analog of human EPO prepared as 
described in Example 5. 

Figure 22 depicts synthetic cytokine designated SEP3-L42, which is a 
10 precision polymer-modified synthetic analog of human EPO prepared as described in 
Example 5 . 

Figure 23 depicts the synthesis of a preferred water-soluble polymer for 
attaching to a synthetic cytokine designated SEP1-B51, which is a precision 
polymer-modified synthetic analog of human EPO prepared as described in Example 
15 7. 

Figure 24 depicts a synthetic cytokine designated SEP1-B51, which is a 
precision polymer-modified synthetic analog of human EPO prepared as described in 
Example 7. 

Figure 25 depicts the synthesis of a preferred water-soluble polymer for 
20 attaching to a synthetic cytokine designated SEP1-B52, which is a precision 
polymer-modified synthetic analog of human EPO prepared as described in Example 
8. 

Figure 26 depicts a synthetic cytokine designated SEP1-B52, which is a 
precision polymer-modified synthetic analog of human EPO prepared as described in 
25 Example 8. 

Figure 27 shows representative analytical data for the precision polymer- 
modified synthetic analogs of human EPO prepared as described in Examples 2-5 
and 7-8. As shown, a representative Isoelectric Focusing Gel (IEF) and non- 
reducing SDS-PAGE gel demonstrate the relative monomer molecular weight of the 
30 folded, purified SEP1-B5 1 . 
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Figure 28 shows the in vitro activity in a factor-dependent cell line of SEP 
compounds SEPO, SEP1-L26, SEP1-L30, SEP1-B50, SEP1-B51 and SEP3-L42. 

Figure 29 shows a representative pharmacokinetic profile comparing plasma 
concentration in nanograms per milliliter (ng/ml) of SEP3-L42 and SEP1-B50 
5 versus time in hours. 

Figure 30 shows linear regression analyses of the in vivo activity as 
measured by the 72-hour red blood cell (RBC) - 59 Fe uptake (as a % of dose) for SEP 
1-B51 and recombinant glycosylated human erythropoietin produced in CHO-cells 
("rhEPO") in a hypoxic rat model. 
10 Figure 31 shows a representative pharmacokinetic profile for clearance in 

rats comparing plasma concentration in ng/ml of SEP1-B51 and rhEPO versus time 
in hours following a single IV dose of 5 p.g/kg for each compound. 

Figure 32 depicts a synthetic chemokine designated RANTES Gl 755-01, 
which is a precision polymer-modified synthetic analog of RANTES prepared as 
1 5 described in Example 1 8. 

Figure 33 depicts a synthetic chemokine designated RANTES G1755, which 
is a precision polymer-modified synthetic analog of RANTES prepared as described 
in Example 19. 

Figure 34 depicts a synthetic chemokine designated RANTES G1805, which 
20 is a precision polymer-modified synthetic analog of RANTES prepared as described 
in Example 20. 

Figure 35 depicts a synthetic chemokine designated RANTES G1806, which 
is a precision polymer-modified synthetic analog of RANTES prepared as described 
in Example 21, 

25 Figure 36 shows a representative analytical data for the synthetic chemokine 

analogs prepared in Examples 18-21. As shown, a representative SDS-PAGE gel 
compares the relative molecular weights of wild type (Wt) RANTES to synthetic 
chemokine analog RANTES G1806 under reducing (R) and non-reducing conditions 
(N). 
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Figure 37 shows a pharmacokinetic profile for clearance in rats comparing 
plasma concentration in picograms per milliliter (pg/ml) of synthetic Rantes analogs 
G1755, G1806, and G1805 versus time in minutes following a single IV dose of 
Equimolar amounts/kilo gram (kg) weight of each animal for each compound. 
5 Figure 38 shows a circularly-permuted, polymer-modified SEP construct. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The invention relates to polymer-modified synthetic bioactive proteins, 
methods of production and use. The invention also relates to glyco-mimetic water- 
soluble polyamide polymers. 

1 0 I. Polymer-Modified Synthetic Bioactive Proteins of The Invention 

Although polymer-modified proteins have been previously described, the 
nature and manner of the possible polymer-modifications of the present invention 
markedly differ from such prior efforts. For instance, the polymer-modified 
synthetic bioactive proteins of the present invention are chemically synthesized, in 

15 whole. In addition, the polymer-modified synthetic bioactive proteins of the 
invention are modified with water-soluble polymers at one or more particular, user- 
selected and user-defined sites. 

Moreover, the synthetic production of the proteins in the present invention 
permits one to ensure that such modifications are present at each of the user-selected 

20 and user-defined sites of every molecule in a preparation. Such uniformity and 
control of synthesis markedly distinguishes the present invention from the random 
modifications permitted to the use of the methods of the prior art. Significantly, the 
present invention permits one to design a synthetic protein in which any non-critical 
residue may be derivatized to contain a polymer adduct. Moreover, for each such 

25 user-selected and user-defined site, the user may define the precise linkage (amide, 
thioester, thioether, oxime, etc.) through which such adducts will be bonded to the 
polypeptide or protein backbone. Additionally, the particular polymer adducts 
desired to be present at a particular site may be varied and controlled, such that a 
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final preparation is obtained in which every protein or polypeptide present contains 
precisely the same derivatized adducts at precisely the same derivatized sites. Thus, 
the present invention permits the formation of homogeneous preparations of 
polymer-modified polypeptides and proteins. 

5 In addition to providing a means for varying the position and number of 

attachment sites to which a water-soluble polymer can be bound, the present 
invention permits one to vary the nature of the bound polymer. The polymer adducts 
that may be incorporated into any particular user-selected and user-defined site can 
be of any defined length. Likewise, consistent with the methods of present 

10 invention, it is possible to employ polymers of different lengths at different sites. 
Thus, in one embodiment, the . polymer-modified synthetic bioactive proteins of the 
present invention may be either mono-modified, or poly-modified, with a water- 
soluble polymer adduct. Where more than one polymer adduct is introduced into a 
particular polypeptide or protein, the employed polymers may be "mono-speciated," 

15 "poly-speciated," "uniformly speciated," or to "diversely speciated in." As used 
herein, the term "mono-speciated" is intended to refer to a polypeptide or protein 
that has been modified by a single species of polymer. In contrast, the term "poly- 
speciated" is intended to refer to a polypeptide or protein that has been modified by 
more than a single polymer species. Such poly-speciated polypeptides or proteins 

20 are said to be "uniformly-speciated" if, at each modified site of the polypeptide or 
protein, the same, single species of polymer is present. In contrast, a poly-speciated 
polypeptide or protein is said to be "diversely-speciated" if, the modified sites of the 
polypeptide or protein are modified with different species of polymer. 

Moreover, it is possible to vary the extent of linearity or branchedness of the 

25 polymer adduct at each of the user-selected in user-defined sites. Thus the polymer 
adducts may be linear, branched, or uniformly branched. The term "uniformly 
branched" is intended to mean that all branches of a polymer at a particular site have 
the same structure and length. As we appreciate, the present invention permits one 
to independently vary both the length of any individual branch, as well as the 

30 structure of the polymer present at such branch point. 
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In sum, the present invention permits one to define (1) the location and (2) 
frequency of polymer-modified, user-selected and user-defined sites in a polypeptide 
or protein backbone, as well as to control the (3) length, (4) species, and (5) degree 
of branching present at each such site. Additionally, with respect to polypeptide and 

5 proteins having multiple polymer modifications, the present invention permits one to 
independently define each of the above-identified five variables for each site. 
Moreover, with respect polypeptides and proteins having branched polymer 
modifications, the present invention permits one to independently define each of the 
above-identified five variables for each branch point. This allows for the synthesis 

10 of homogenous compositions of precision polymer-modified synthetic bioactive 
proteins having multiple copies of any or all of the 20 genetically encoded amino 
acid side chains present and left unencumbered by unwanted polymer attachment. 

Thus the invention affords significant flexibility in the design, synthesis and 
use of polymer-modified synthetic bioactive proteins, particularly those that target 

15 receptors for cytokines, growth factors and the like. For instance, the amino acid 
sequence of the synthetic bioactive proteins of the invention may comprise one or 
more deletions, insertions or substitutions relative to the amino acid sequence of a 
genetically encoded protein on which it is based. The amino acid sequence can be 
substituted with one or more irregular amino acids, such as a pseudo amino acid and 

20 or other amino acid bearing an unnatural side chain the like, for example, one that is 
modified to bear a unique chemoselective functional group for attaching a water- 
soluble polymer adduct thereto. Thus, the water-soluble polymer can be attached 
through an irregular amino acid, or a genetically encoded amino acid. The water- 
soluble polymer can be linear or branched, and it can have a terminal group 

25 comprising a chemical moiety such as a carboxylic acid, aliphatic, amide or amine, 
Moreover, the water-soluble polymer can be monodisperse, i.e., it can be made to 
comprise a single molecular species of precisely defined structure and composition. 
Thus, the water-soluble polymer can be designed to contain features to precisely tune 
half-life, immunogenicity, potency, storage stability, dosage, deliverability and the 

30 like of the target protein modified therewith. 
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In a preferred embodiment, the invention is directed to molecularly 
homogenous polymer-modified synthetic bioactive proteins. These proteins have the 
formula: 

Protein-Un-sl-B-s2-Polymer-s3-J* 

5 In this formula, Protein comprises a polypeptide chain of a ribosomally 

specified protein. The polypeptide chain comprises one or more non-overlapping 
peptide segments covalently bonded by one or more chemical ligation sites. 

By "ribosomally specified protein" is intended a protein produced 
ribosomally in a cell. In a preferred embodiment, the polymer-modified polypeptide 

10 chain possess a bioactivity that mimics a bioactivity associated with such 
ribosomally specified protein such as a natural mammalian protein (e.g., human, 
simian, bovine, murine, porcine, ovine, or equine, avian, or piscine protein). More 
preferable is a bioactivity that mimics a bioactivity associated with a ribosomally 
specified protein selected from the group consisting of a protein receptor or fragment 

15 thereof, a protein receptor ligand or fragment thereof, and a cytokine. Bioactive 
polypeptide chains of ribosomally specified proteins include novel and well as those 
obtainable from various gene or protein databases. Examples of such databases 
include, but are not limited to, GeneBank (Benson, et al., Nucleic Acids Res (1998) 
26(1): 1-7; USA National Center for Biotechnology Information, National Library of 

20 Medicine, National Institutes of Health, Bethesda, MD, USA), TIGR Database (The 
Institute for Genomic Research, Rockville, MD, USA), Protein Data Bank 
(Brookhaven National Laboratory, USA), and the ExPASy and Swiss-Protein 
database (Swiss Institute of Bio informatics, Geneve, Switzerland). The most 
preferred ribosomally specified proteins are cytokines, of which polypeptide chains 

25 and their specific amino acid sequences are well known (See e.g., "The Cytokine 
Handbook", 3 rd Edition, Ed. A. Thomas, Associated Press, 1998; "Cytokines", Ed. 
A. Mire-Sluis & R. Thorpe, Academic Press, 1998; and "Cytokine Reference", Vol. 
1, Ligands, A compendium of cytokines and other mediators of host defense, Eds. 
J.J. Oppendheim and M. Feldmann, Acedemic Press, 2001)). 

30 By "chemical ligation site" is intended the N-terminal amino acid of a first 

peptide or polypeptide and the C-terminal amino acid of a second peptide or 
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polypeptide that form or are capable of forming a non-reversible covalent bond 
therein between by chemical ligation. As used herein, "chemical ligation" refers to a 
chemoselective reaction involving the covalent joining of two chemical moieties, 
each of which moieties bears a mutually reactive functional group that is uniquely 

5 capable of forming a non-reversible covalent bond with the other. Chemical ligation 
includes covalent ligation of (1) a first peptide or polypeptide bearing a uniquely 
reactive C-terminal group with (2) a second peptide or polypeptide bearing a 
uniquely reactive N-terminal group, where the C-terminal and N-terminal reactive 
groups form a non-reversible covalent bond therein between. In particular, chemical' 

10 ligation includes any chemoselective reaction chemistry that can be applied to 
ligation of unprotected peptide segments. 

U is a residue of a unique functional group covalently bonded to a mutually 
reactive unique functional group of a side chain n of one or more amino acids of one 
or mote of the non-overlapping peptide segments, and where n is a discrete integer 

15 from 1 to 6. Most preferably side chain n is a discrete integer from 1 to 4. As used 
herein the term "amino acid" is intended to include the 20 genetically coded amino 
acids, rare or unusual amino acids that are found in nature, and any of the non- 
naturally occurring amino acids, such as irregular amino acids; sometimes referred to 
as amino acid residues when in the context of a peptide, polypeptide or protein. 

20 Preferred embodiments of U and side chain n are covalent bonds formed from 
unique mutually reactive groups, where such bonds are selected from oxime, amide, 
amine, urethane, ether, thioether, ester, hydrazide, oxazolidine, thaizolidine, 
thioether, ether, and ester. The most preferred U and n bond is one where U is 
covalently bonded to side chain n through a bond formed by chemical ligation 

25 selected from the group consisting of amide, oxime, thioester, hydrazone, 
thaizolidine, oxazolidine. 

B is a branching core having three or more arms that may be the same or 
different and may be present or absent. Most preferably B is present and comprises 
three or more arms, and even more preferably four or more arms. In particular, one 

30 arm of B is joined to U (optionally through a spacer or linker si), and a second arm 
of B is joined to Polymer (optionally through a spacer or linker s2). A favored 
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polymer-modified synthetic bioactive protein is one in which at least one of the 
branching arms of moiety B comprises a residue of bond selected from the group 
consisting of oxime, amide, amine, urethane, thioether, ester, hydrazide, oxazolidine, 
and thaizolidine. A preferred branching group B of a polymer-modified synthetic 
5 bioactive protein of the invention comprises a branching core selected from the 
group consisting of amino, carboxylate and mixed amino-carboxylate. Preferred 
amino branching core comprise lysine, preferred carboxylate branching core 
comprise glutamic or aspartic acid, and preferred mixed amino-carboxylate 
branching core comprises gamma-glutamic acid, or derivatives thereof. 

10 The Polymer component is a substantially non-antigenic water-soluble 

polymer that may be the same or different where B is present. By "water-soluble 
polymer" is intended a substantially non-antigenic polymer that is soluble in water 
and has an atomic molecular weight greater than about 1,000 Daltons. The Polymer 
will preferably have an effective hydrodynamic molecular weight of greater than 

15 10,000 Da, and more preferably about 20,000 to 500,000 Da, and most preferably 
about 40,000 to 300,000 Da. By "effective hydrodynamic molecular weight" is 
intended the effective water-solvated size of a polymer chain as determined by 
aqueous-based size exclusion chromatography (SEC). When the water-soluble 
polymer contains polymer chains having polyalkylene oxide repeat units, such as 

20 ethylene oxide repeat units, it is preferred that each chain have an atomic molecular 
weight of between about 200 and about 80,000 Da and preferably between about 
1,500 and about 42,000 Da, with 2,000 to about 20,000 Da being most preferred. 
Unless referred to specifically, molecular weight is intended to refer to atomic 
molecular weight. 

25 The Polymer component can have a wide range of molecular weight, and 

polymer subunits. These subunits may include a biological polymer, a synthetic 
polymer, or a combination thereof. Examples of such water-soluble polymers 
include: dextran and dextran derivatives, including dextran sulfate, P-amino cross 
linked dextrin, and carboxymethyl dextrin, cellulose and cellulose derivatives, 

30 including methylcellulose and carboxymethyl cellulose, starch and dextrines, and 
derivatives and hydroylactes of starch, polyalklyene glycol and derivatives thereof, 
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including polyethylene glycol, methoxypolyethylene glycol, polyethylene glycol 
homopolymers, polypropylene glycol homopolymers, copolymers of ethylene glycol 
with propylene glycol, wherein said homopolymers and copolymers are 
unsubstituted or substituted at one end with an alkyl group, heparin and fragments of 
5 heparin, polyvinyl alcohol and polyvinyl ethyl ethers, polyvinylpyrrolidone, 
aspartamide, and polyoxyethylated polyols, with the dextran and dextran derivatives, 
dextrine and dextrine derivatives. It will be appreciated that various derivatives of 
the specifically recited water-soluble polymers are also contemplated. 

Water-soluble polymers such as those described above are well known, 

10 particularly the polyalkylene oxide based polymers such as polyethylene glycol 
"PEG" (See. e.g., "Polyethylene glycol) Chemistry: Biotechnical and Biomedical 
Applications", J.M. Harris, Ed., Plenum Press, New York, NY (1992); and 
"Polyethylene glycol) Chemistry and Biological Applications", J.M. Harris and S. 
Zalipsky, Eds., ACS (1997); and International Patent Applications: WO 90/13540, 

15 WO 92/00748, WO 92/16555, WO 94/04193,WO 94/14758, WO 94/17039, WO 
94/18247, WO 94/28937, WO 95/11924, WO 96/00080, WO 96/23794, WO 
98/07713, WO 98/41562, WO 98/48837, WO 99/30727, WO 99/32134, WO 
99/33483, WO 99/53951, WO 01/26692, WO 95/13312, WO 96/21469, WO 
97/03106, WO 99/45964, and US Patents Nos. 4,179,337; 5;075,046; 5,089,261; 

20 5,100,992; 5,134,192; 5,166,309; 5,171,264; 5,213,891; 5,219,564; 5,275,838; 
5,281,698; 5,298,643; 5,312,808; 5,321,095; 5,324,844; 5,349,001; 5,352,756; 
5,405,877; 5,455027; 5,446,090; 5,470,829; 5,478,805; 5,567,422; 5,605,976; 
5,612,460; 5,614549; 5,618,528; 5,672,662; 5,637,749; 5,643,575; 5,650,388; 
5,681,567; 5,686,110; 5,730,990; 5,739,208; 5,756,593; 5,808,096; 5,824, 778; 

25 5,824,784; 5,840,900; 5,874,500; 5,880,131; 5,900,461; 5,902,588; 5,919,442; 
5,919,455; 5,932,462; 5,965,119; 5,965,566; 5,985,263; 5,990,237; 6,011,042; 
6,013,283; 6,077, 939; 6,113,906; 6,127355; 6,177,087; 6,180,095; 6,194,580; 
6,214,966). 

The more preferred Polymer component comprises a polyalkylene oxide, 
30 polyamide alkylene oxide, or derivatives thereof. An even more preferred Polymer 
component is a polyamide having a molecular weight greater than about 1,000 
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Daltons of the formula -[C(0)-X-C(0)-NH-Y-NH]n- or -[NH-Y-NH-C(0)-X- 
C(0)]n-, where X and Y are divalent radicals that may be the same or different and 
may be branched or linear, and n is a discrete integer from 2- 100, and more 
preferably from 2 to 50, and where either or both of X and Y comprises a 
5 biocompatible, substantially non-antigenic water-soluble repeat unit that may be 
linear or branched. The most preferred water-soluble repeat unit comprises an 
ethylene oxide of the formula -(CH 2 -CH 2 -0)- or -(CH 2 -CH 2 -0)-. The number of 
such water-soluble repeat units can vary significantly, but the more preferred number 
of such units is from 2 to 500, 2 to 400, 2 to 300, 2 to 200, 2 to 100, and most 

10 preferably 2 to 50. An example of a more preferred embodiment is where one or 
both of X and Y is selected from: -((CH 2 ) nl -(CH 2 -CH 2 -0) n2 -(CH 2 ) n i-)- or - 
((CH 2 )ni-(0-CH2-CH2)^-(CH 2 )ni-), where nl is 1 to 6, 1 to 5, 1 to 4 and most 
preferably 1 to 3, and where n2 is 2 to 50, 2 to 25, 2 to 15, 2 to 10, 2 to 8, and most 
preferably 2 to 5. An example of a highly preferred embodiment is where X is - 

15 (CH 2 -CH 2 )-, and where Y is -(CH 2 -(CH 2 -CH 2 -0) 3 -CH 2 -CH 2 -CH 2 )- or -(CH 2 -CH 2 - 
CH 2 -(0-CH 2 -CH 2 ) 3 -CH 2 )-. 

The Polymer component or one or more of the spacers or linkers, when 
present, may include polymer chains or units that are biostable or biodegradable. 
For example, Polymers with repeat linkages have varying degrees of stability under 

20 physiological conditions depending on bond lability. Polymers with such bonds can 
be categorized by their relative rates of hydrolysis under physiological conditions 
based on known hydrolysis rates of low molecular weight analogs, e.g., from less 
stable to more stable polycarbonates (-O-C(O)-O-) > polyesters (-C(O)-O-) > 
polyurethanes (-NH-C(O)-O-) > polyorthoesters (-0-C((OR)(R'))-0-) > polyamides 

25 (-C(O)-NH-). Similarly, the linkage systems attaching a water-soluble polymer to a 
target molecule may be biostable or biodegradable, e.g., from less stable to more 
stable carbonate (-OC(O)-O-) > ester (-C(O)-O-) > urethane (-NH-C(O)-O-) > 
orthoester (-0-C((OR)(R'))-0-) > amide (-C(0)-NH-). These bonds are provided by 
way of example, and are not intended to limit the types of bonds employable in the 

30 polymer chains or linkage systems of the water-soluble polymers of the invention. 
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Component J* is a residue of pendant group having a net charge under 
physiological conditions selected from the group consisting of negative, positive and 
neutral. This includes alkyl, aryl, heteroalkyl, heteroaryl, arylalkyl, acyl, alkoxy, 
alkenyl, alkynyl, amideo, amino, carbonyl groups and the like, that are substituted or 
5 unsubstituted, and as well as salts thereof. Neutral groups preferably are alkyl or 
alkoxy groups, and can include, but are not limited to moieties containing from 1 to 
18 carbons, and may be linear or branched. When provided as a charged group, J* 
comprise an ionizable functional group. Examples of functional groups include, but 
are not limited to, carboxylic acids, esters, amides, nitriles, thiols, and hydroxyls. 

10 Such J* groups may be a component of amino acids, nucleic acids, fatty acids, 
carbohydrates, and derivatives thereof, and moieties such as chitin, chitosan, 
heparin, heparan sulfate, chondroitin, chondroitin sulfate, dermatan and dermatan 
sulfate, cyclodextrin, dextran, hyaluronic acid, phospholipid, sialic acid and the like. 
J* preferably comprises an ionizable moiety selected from carboxyl, amino, thiol, 

15 hydroxyl, phosphoryl, guanidinium, imidazole and salts thereof. The most preferred 
is where J* comprises an ionizable carboxylate moiety and has a net negative charge 
under physiological conditions. 

The components si, s2, and s3 are spacer or linker moieties that may be the 
same or different, and may be individually present or absent. Preferred spacers or 

20 linkers include linear or branched moieties comprising one or more repeat units 
employed in a water-soluble polymer, diamino and or diacid units, natural or 
unnatural amino acids or derivatives thereof, as well as aliphatic moieties, including 
alkyl, aryl, heteroalkyl, heteroaryl, alkoxy, and the like, which preferably contain up 
to 18 carbon atoms or even an additional polymer chain. Most preferably the spacer 

25 or linker comprises a polymer chain. 

Alternatively, the above formula Protein-U«-sl-B-s2-Polymer-s3-J* can be 
represented as "Protein-Un-B-Polymer-J*" where the si, s2 and s3 groups may be 
present or absent. The more preferred Polymer component is where the water- 
soluble polymer U-sl-B-s2-Polymer-s3-J* is produced in total by stepwise synthesis. 

30 This means that these polymers will have a precise molecular weight and defined, 
structure. In contrast, normal polymer synthesis, which is a polymerization process, 
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results in a mixture in which chains are of differing lengths, and so there is a 
distribution of molecular weights and sizes that are difficult if not impossible to 
separate. The ability to control molecular purity is advantageous in that a synthetic 
protein can be constructed that has a water-soluble polymer attached thereto and that 
5 is monodisperse. This represents a significant advantage in that variable properties 
resulting from heterogeneous compounds can be avoided, and only those compounds 
with the most preferred properties can be prepared and isolated with relative ease. 

Alternatively, the above formula Protein-U«-sl-B-s2-Polymer-s3-J* can be 
represented as "Protein-Uw-B-Polymer-J*" where the si, s2 and s3 groups may be 

10 present or absent. 

More preferred compounds of the formula Protein-Uw-sl-B-s2-Polymer-s3- 
J* have a polypeptide chain comprising an amino acid sequence of a ribosomally 
specified protein, and more preferably where the polypeptide chain has one or more 
irregular amino acids. As used here, "irregular amino acid" refers to an amino acid 

15 having a non-genetically encoded side chain, a non-genetically encoded backbone, a 
non-genetically encoded substituted Net or aC(O) moiety, or a combination thereof, 
i.e., other than one of the ribosomally installed 20 genetically encoded amino acids. 
Examples of preferred irregular amino acids include amino acids having side chains 
bearing a unique functional group other than a genetically encoded functional group, 

20 as well as pseudo amino acids and various amino acid derivatives. In this regard, the 
present invention permits wide selectability and flexibility in the design and/or 
construction of synthetic bioactive proteins. Examples of non-ribosomally installed 
amino acids that may be used in accordance with a present invention include, but are 
not limited to: D-amino acids, 13-amino acids, pseudo-glutamate, y-aminobutyrate, 

25 ornithine, homocysteine, N-substituted amino acids (R. Simon et al., Proc. Natl. 
Acad. Sci. U.S.A. (1992) 89: 9367-71; WO 91/19735 (Bartlett et al.), U.S. Patent 
5,646,285 (Baindur), a-aminomethyleneoxy acetic acids (an amino acid-Gly 
dipeptide isostere), and a-aminooxy acids and other amino acid derivatives having 
non-genetically non-encoded side chain function groups etc. Peptide analogs 

30 containing thioamide, vinylogous amide, hydrazino, methyleneoxy, thiomethylene, 
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phosphonamides, oxyamide, hydroxyethylene, reduced amide and substituted 
reduced amide isosteres and B-sulfonamide(s) may be employed. By "pseudo amino 
acid" is intended an amino acid having an identical backbone structure and side- 
chain group as a genetically encoded amino acid, but differing in the atomic 

5 composition of the side chain atoms. 

In a preferred embodiment, synthetic bioactive proteins are provided that 
have a water-soluble polymer attached to an irregular amino acid of a polypeptide 
chain thereof. Most preferred irregular amino acids for attachment of a water- 
soluble polymer thereto employ chemoselective ligation chemistry that can be used 

10 in the presence of genetically encoded functional groups without reacting with them. 

In another embodiment, the invention is directed to a synthetic bioactive 
protein comprising a polypeptide chain comprising an amino acid sequence of a 
ribosomally specified glycoprotein, where the polypeptide chain has one or more 
water-soluble polymers attached thereto, and that preferably has a monomer 

15 molecular weight of greater than 25 kDa. This includes synthetic bioactive proteins 
that possess a bioactivity that mimics a bioactivity associated with the ribosomally 
specified glycoprotein. As used herein, "glycoprotein" refers to a protein containing 
a carbohydrate chain covalently linked to an amino acid side chain of the protein 
through a sugar residue, and is sometimes referred to as glycosylated protein. The 

20 carbohydrate of a glycoprotein may be in the form of a monosaccharide, 
disaccharide(s), oligosaccharide(s), polysaccharide(s), or their derivatives (e.g., 
sulfo- or phospho-substituted). Includes glycoproteins having more than one 
carbohydrate attached to the protein, and are exemplified by glycoproteins having re- 
linked and O-linked glycosylation. Includes glycosylated bioactive domains of 

25 natural or non-natural glycoproteins. It will be appreciated that glycoproteins that 
have been mutated to eliminate one or more glycosylation sites, or that are expressed 
in cells that do not glycosylate the protein (e.g., E. coli) are embodied in this 
definition, as the residual site for polymer modification will be positionally the 
same. 

30 Preferred polypeptide chains of a polymer-modified synthetic bioactive 

protein of the invention comprise an amino acid sequence of a ribosomally specified 
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mammalian glycoprotein, such as a human glycoprotein. More preferred 
polypeptide chains comprise an amino acid sequence of a ribosomally specified 
glycoprotein that is a cytokine, of which many polypeptide chains and their specific 
amino acid sequences and associated biological properties are well known (See e.g., 
5 "The Cytokine Handbook", 3 rd Edition, Ed. A. Thomas, Associated Press, 1998; 
"Cytokines", Ed. A. Mire-Sluis & R. Thorpe, Academic Press, 1998; and "Cytokine 
Reference", Vol. 1, Ligands, A compendium of cytokines and other mediators of 
host defense, Eds. J.J. Oppendheim and M. Feldmann, Acedemic Press, 2001)). In a 
preferred embodiment, the ribosomally specified glycoprotein comprises one or 

10 more glycosylation sites, and a water-soluble polymer is attached to the polypeptide 
chain of the synthetic bioactive protein at one or more sites corresponding to one or 
more the glycosylation sites of the ribosomally specified glycoprotein. In a more 
preferred embodiment, the water-soluble polymer is attached to the polypeptide 
chain exclusively at one or more sites corresponding to one or more such 

1 5 glycosylation sites. This aspect of the invention includes synthetic bioactive proteins 
where the ribosomally specified glycoprotein is a recombinantly produced 
glycoprotein, which can be a natural glycoprotein or a non-natural glycoprotein, the 
latter of which can further include one or more non-natural glycosylation sites. By 
"glycosylation site" is intended an amino acid sequence of a protein that encodes for 

20 the enzymatic attachment of an oligosaccharide (carbohydrate) chain to the side- 
chain of an amino acid residue of the amino acid sequence of the protein; 
exemplified by N-linked and O-linked glycosylation sites. It will be appreciated that 
glycoproteins that have been mutated to eliminate one or more of such glycosylation 
sites are embodied in the definition of 'glycosylation site', as the residual site for 

25 polymer modification will be positionally the same. By "naturally occurring 
glycosylation site" is intended a glycosylation site of a glycoprotein found in nature. 
By "non-naturally occurring glycosylation site" is intended a glycosylation site that 
has been engineered into a protein. For instance, recombinant human EPO and 
GCSF have been engineered in this manner (see, e.g., U.S. Patent Nos. U.S. Patent 

30 No. 5,856,298 and 5,218,092). 
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This preferred embodiment of the invention is based in part on the finding 
that when a synthetic protein having a polypeptide chain of a glycoprotein is 
modified at one or more glycosylation sites thereof with a water-soluble polymer, 
such as the polymers described herein, the water-soluble polymer can be utilized to 
5 take advantage of one or more of the biological effects attributed to a carbohydrate 
chain normally found at that position in the counterpart naturally occurring 
glycoprotein, glycoproteins that have been engineered to contain additional 
glycosylation sites, or non-glycoproteins that have been engineered to be 
glycosylated. Such biological effects include modulation of protease resistance, 

10 immunogenicity, receptor-specificity, specific activity, potency and 
pharmacokinetics. However, by eliminating the presence of a carbohydrate chain, 
and replacing it with preferred water-soluble polymers of the invention, significant 
benefits are obtained including the avoidance of enzymatic degradation and 
clearance such as when pendant sialic acid residues of the sugar chains are removed, 

15 instability, limited circulation half-life, distribution, poor handling properties etc. 
Significantly, its has been found that such polymer-modified synthetic bioactive 
proteins of the invention retain such advantageous biological properties in the 
substantive absence of loss of biological activity when compared to a non-modified 
counterpart protein, including increased bioactivity, even when the monomer 

20 molecular weight of the synthetic protein is greater than 25 lcDa. 

Thus, in another preferred embodiment, the invention is directed to a 
synthetic bioactive protein comprising a polypeptide chain comprising an amino acid 
sequence of a ribosomally specified glycoprotein, where the polypeptide chain has 
one or more water-soluble polymers attached thereto and a monomer molecular 

25 weight of greater than 25 kDa, and where the synthetic bioactive protein comprises a 
bioactivity that is equal to or better than a corresponding synthetic bioactive protein 
having a polypeptide chain that is devoid of said water-soluble polymer. The 
bioactivity can be in vitro, in vivo or both. In a preferred embodiment, the 
bioactivity is in vivo. In another preferred embodiment, the invention is directed to a 

30 synthetic bioactive protein comprising a polypeptide chain comprising an amino acid 
sequence of a ribosomally specified glycoprotein, where the polypeptide chain has 
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one or more water-soluble polymers attached thereto and a monomer molecular 
weight of greater than 25 kDa, and where the synthetic bioactive protein comprises a 
bioactivity that is equal to or better than the ribosomally specified glycoprotein. 
Here again, the bioactivity can be in vitro, in vivo or both. In a preferred 
5 embodiment, the bioactivity is in vivo. Most preferably, such polymer modified 
synthetic bioactive proteins will have a discrete number of water-soluble polymers 
attached thereto. 

A benefit of targeting the non-naturally occurring glycosylation sites of a 
protein is that a protein found in nature, whether a natural glycoprotein or not, can be 

10 modified to contain a glycosylation site at one or more sites or regions, such as those 
unprotected disordered or surface-exposed regions responsible for immunogenicity 
or protease sensitivity, using recombinant DNA techniques (e.g., site-directed 
mutagenesis or directed evolution). For instance, non-naturally occurring 
glycosylation sites are designed at potential immunogenic or protease sensitive sites 

15 of a target polypeptide sequence, and screened in a recombinant system for 
production (e.g., CHO cell and yeast cell expression systems etc.) and assayed for 
the relevant desired bioactivity. New sites found to be amenable to glycosylation are 
then utilized in the design and synthesis of synthetic bioactive proteins of the 
invention that have a water-soluble polymer attached exclusively one or more of 

20 those positions. 

In another embodiment, the invention is directed to a synthetic bioactive 
protein comprising a pseudo amino acid at a ligation site of the protein, and 
optionally, a water-soluble polymer attached to the protein. These compounds are 
produced by forming a ligation product having an unprotected side-chain functional 

25 group at the ligation site by ligating a first peptide or polypeptide segment having an 
N-terminal amino acid comprising a chemoselective reactive functional group, such 
as a cysteine, to a second peptide or polypeptide having a chemical ligation 
compatible C-terminal functional group, such as a ct-carboxy thioester, and 
chemically converting the unprotected side-chain functional group at the ligation site 

30 to a pseudo amino acid, such as carboxymethylation of the cysteine thiol side-chain 
to form a pseudo glutamate amino acid. Pseudo amino acids formed by conversion 
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of cysteines at native chemical ligation sites is referred to herein as "Pseudo Native 
Chemical Ligation," which is described in detail below. 

Also provided is a synthetic bioactive protein comprising a water-soluble 
polymer attached to a side-chain of an amino acid at a ligation site of the protein. 
5 These compounds are synthesized by forming a ligation product having an 
unprotected side-chain functional group at the ligation site by ligating a first peptide 
or polypeptide segment having an N-terminal amino acid comprising a 
chemoselective reactive functional group, such as a cysteine, to a second peptide or 
polypeptide having a ligation compatible C-terminal functional group, such as a cc- 

10 carboxy thioester, and attaching a water-soluble polymer to the unprotected side- 
chain functional group at the ligation site. 

Utilization of pseudo amino acid chemical ligation and the chemical ligation 
site polymer modification methods of the invention afford several advantages over 
the prior art, including the robust synthesis of synthetic bioactive proteins that are 

15 otherwise devoid of suitable ligation sites, expansion of the sites (and attachment 
chemistries) to which site-specific polymer modification can be exploited in a 
routine and cost-effective manner, as well as the synthesis of synthetic bioactive 
proteins of substantial molecular weight, among others. They also are particularly 
suited for high throughput analoging for fine-tuning of desired biological properties, 

20 including scanning individual or multiple sites for water-soluble polymer 
attachment. 

As noted above, the biological properties of the polymer-modified synthetic 
bioactive proteins of the invention can be modified by precisely adjusting the sites 
and linkage chemistries of polymer attachment in combination with the precision 

25 adjustment of the molecular weight, the polymer composition, the structure (e.g., 
linear versus branched, or mixtures thereof), and the pendant group (e.g., charged 
versus uncharged, or mixtures thereof) of the water-soluble polymer. In particular, 
in addition to increasing the molecular weight of a protein to improve half-life, and 
branching etc., the water-soluble polymer attached thereto can render the synthetic 

30 protein to have a precise charge, as measured by isoelectric point that is 
approximately equal to the charge of a corresponding biologically produced protein 
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on which the amino acid sequence of the synthetic bioactive protein is based. This 
has the advantage of mimicking the natural charge of a corresponding ribosomally 
specified protein. A preferred embodiment of the invention is thus directed to 
synthetic bioactive proteins that combine the above features, as well as the water- 
5 soluble polymers utilized therefor, particularly structurally defined polymer adducts 
that are capable of being attached at preselected positions. 

As also indicated above, the polymer-modified synthetic bioactive proteins of 
the present invention can have substantial total molecular weight, being greater than 
about 25 kDa. For the purposes of the present invention, such determinations of 

10 molecular weight are to be made by denaturing SDS polyacrylamide electrophoresis. 
The term "monomer molecular weight" is intended to refer to the molecular weight 
of a monomer synthetic protein, as distinguished from synthetic proteins that may 
possess multiple copies of a protein or polypeptide. The term "monomer 
polypeptide molecular weight" is intended to refer to the molecular weight of a 

15 monomer polypeptide, as distinguished from synthetic proteins that may possess 
polymers attached thereto and/or multiple copies of a protein or polypeptide. 

As used herein, a protein is said to be "synthetic" if non-recombinant 
technology has been employed to polymerize some, and most preferably all, of its 
amino acid residues. The term "non-recombinant technology" is intended to 

20 distinguish technologies that involve organic chemistry and other synthetic 
polymerization approaches from technologies involving the translation of RNA into 
protein, either in vivo or in vitro. Synthetic proteins include totally synthetic and 
semi-synthetic proteins. A totally synthetic protein is produced where all ligation 
components are man-made by chemical synthesis, i.e., ribosomal-free synthesis. A 

25 semi-synthetic protein is produced where at least part of a ligation component is 
made by biological synthesis, i.e., ribosomally in a cell or cell-free translation 
system, and another part is made by chemical synthesis. 

As used herein, a synthetic protein is said to be "bioactive" if it possesses a 
discernible bioactivity that is dependent upon the synthetic protein's structure or 

30 amino acid sequence, such the variation in such structure or sequence enhances, 
modifies, or attenuates the protein's bioactivity. Without limitation, such 
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"bioactivity" includes the capacity of the protein to mediate a catalytic, signaling or 
inducing reaction. As used herein, a protein is said to "mediate a catalytic reaction" 
by converting a substrate into a product without itself being consumed or 
permanently modified. A protein is said to "mediate a signaling or inducing 
5 reaction" if its presence causes an organism, or its tissues or cells, to initiate, 
continue, enhance, modify, or attenuate gene expression, cellular differentiation, or 
cellular proliferation. Examples on such signaling or inducing reactions include 
inducing cytokine expression or a response to cytokine expression, inducing 
erythropoiesis, inducing or attenuating inflammation and/or an inflammatory 
10 process, initiating angiogenesis or vascularization, inducing apoptosis, affecting the 
cell cycle, etc. 

Accordingly, by "polymer-modified synthetic bioactive protein" as used 
herein refers to a synthetic bioactive protein having one or more water-soluble 
polymers attached thereto. The bioactivity of the synthetic bioactive proteins of the 

1 5 present invention also includes the capacity of the protein to bind specifically to a 
receptor, ligand or antibody. As used herein, the term "specific" binding is intended 
to refer to a structurally-based binding interaction, such as that existing between a 
hormone and its receptor, or an antibody and an antigenic epitope, as distinguished 
from "non-specific" binding based upon charge, solubility, hydrophobicity, etc. 

20 The polymer-modified synthetic bioactive proteins of the present invention 

may be designed to possess a bioactivity that mimics a bioactivity associated with a 
mammalian (including human, simian, bovine, murine, porcine, ovine, equine, etc.), 
avian, or piscine protein. As used herein, a first protein is said to mimic a second 
protein if the first protein possesses a bioactivity that is modified, enhanced, or 

25 attenuated with respect to the bioactivity of the second protein. A bioactivity is said 
to be "associated" with the protein if its presence is directly or indirectly dependent 
upon the amino acid sequence of the protein. In an alternative embodiment, the 
bioactive proteins of the present invention may be wholly or partly engineered, 
without reference to any corresponding natural bioactive counterpart. 

30 Preferred synthetic bioactive proteins of the present invention include 

polymer-modified proteins other than receptors, and polymer-modified soluble 
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receptor domains. For instance, synthetic bioactive proteins comprising a polymer- 
modified soluble receptor domain are particularly preferred. Examples of soluble 
receptor domains that are preferred for polymer modification are described in 
PCT/US00/06297, and include soluble domains of adrenocorticotropic hormone 
5 receptor and its bioactive fragments, angiotensin receptor, atrial natriuretic receptor, 
bradykininin receptor, growth hormone receptor, chemotatic receptor, dynorphin 
receptor, endorphin receptor, the receptor for p-lipotropin and its bioactive 
fragments, enkephalin receptor, enzyme inhibitor receptors, the receptor for 
fibronectin and its bioactive fragments, gastrointestinal- and growth hormone- 
10 releasing peptide receptors, the receptor for luteinizing hormone releasing peptide, 
the receptor for melanocyte stimulating hormone, neurotensin receptor, opioid 
receptor, oxytocin receptor, vasopressin receptor, vasotocin receptor, the receptor for 
parathyroid hormone and fragments, protein kinase receptor, somatostatin receptor, 
substance P receptor. 

15 Polymer-modified synthetic cytokines are particularly preferred embodiments 

of the invention. In particular, the polymer-modified synthetic bioactive proteins of 
the invention include synthetic cytokines, including synthetic chemokines, which 
mediate specific effects on cell-cell interaction, communication and on the behavior 
of other cells in a biological setting. As used herein, the term "cytokine" is intended 

20 to include the interleukins ("IL") (such as IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL- 
9, IL-10, DL-11, IL-12, IL-13, IL14, etc.), lymphokines and signaling molecules such 
as erythropoiesis stimulating proteins (e.g., erythropoietin (EPO), thrombopoietin 
(TPO)), tumor necrosis factor (TNF), interferons, etc., growth factors, such as 
transforming growth factor, nerve growth factor, brain derived growth factor, 

25 neurotrophin-3, neurotrophin-4, heptaocyte growth factor, T-cell Growth Factor 
(TGF, TGF-pi, TGF-B2, TGF-p3, etc.), Colony Stimulating Factors (G-CSF, GM- 
CSF, M-CSF etc.), Epidermal Growth Factors (EGF, LIF, KGF, OSM, PDGF, IGF-I, 
etc.), Fibroblast Growth Factor (aFGF, (3FGF, etc.), and hormones, particularly 
signal peptide hormones, such as growth hormone, and chemokines. As used herein, 

30 the term "chemokine" is intended to refer to the chemotactic cytokines, such as 
6Ckine, 9E3, ATAC, ABCD-1, ACT-2, ALP, AMAC-1, AMCF-1, AMCF-2, AIF, 



WO 02/20033 



PCT7US01/21930 



-36- 

ANAP, Angie, beta-Rl, Beta-Thromboglobulin, BCA-1, BLC, blr-1 ligand, BRAK, 
CIO, CCF18, Ck-beta-6, Ck-beta-8, Ck-beta-8-1, Ck-beta-10, Ck-beta-11, cCAF, 
CEF-4, CINC, C7, CKA-3, CRG-2, CRG-10, CTAP-3, DC-CK1, ELC, Eotaxin, 
Eotaxin-2, Exodus-1, Exodus-2, ECIP-1, ENA-78, EDNAP, ENAP, FIC, FDNCF, 
5 FINAB, Fractalkine, G26, GDCF, GOS-19-1, GOS-19-2, GOS-19-3, GCF, GCP-2, 
GCP-2-like, GROl, GR02, GR03, GRO-alpha, GRO-beta, GRO-gamma, H400, 
HC-11, HC-14 , HC-21, HCC-1, HCC-2, HCC-3, HCC-4 H174 , Heparin 
neutralizing protein, Humig, 1-309, ILINCK, I-TAC , IfilO, DL8, IP-9, IP- 10, IRH, 
JE, KC, Lymphotactin, L2G25B, LAG-1, LARC, LCC-1, LD78-alpha, LD78-beta, 

10 LD78-gamma, LDCF, LEC, Lkn-1, LMC, LAI, LCF, LA-PF4, LDGF, LDNAP, LIF, 
LIX, LUCT, Lungkine, LYNAP, Manchester inhibitor, MARC, MCAF, MCP-1, 
MCP-2, MCP-3, MCP-4, MCP-5, MDC, MIP-1-alpha, MIP-1-beta, MIP-1-delta, 
MIP-1-gamma, MIP-3, MEP-3-alpha, MIP-3-beta, MIP-4, MIP-5, Monotactin-1, 
MPIF-1, MPIF-2, MRP-1, MRP-2, Ml 19, MDNAP, MDNCF, Megakaryocyte- 

15 stimulatory-factor, MGSA, Mig, MIP-2, mob-1, MOC, MONAP, NC28, NCC-1, 
NCC-2, NCC-3, NCC-4 N51, NAF, NAP-1, NAP-2, NAP-3, NAP-4, NAP S, NCF, 
NCP, Neurotactin, Oncostatin A, P16, P500, PARC, pAT464, pAT744, PBP, PBP- 
like, PBSF, PF4, PF4-like, PF4-ALT, PF4V1, PLF, PPBP, RANTES, SCM-l-alpha, 
SCI, SCY A26, SLC, SMC-CF, ST38, STCP-1, SDF-1 -alpha, SDF-l-beta, TARC, 

20 TCA-3, TCA-4, TDCF, TECK, TSG-8, TY5, TCF, TLSF-alpha, TLSF-beta, TPAR- 
l,andTSG-l. 

Such cytokines, and their chemokine subfamilies and their corresponding 
polypeptide chains, variants thereof and associated bioactivities are well know and 
publicly available through multiple sources (See e.g., "The Cytokine Handbook", 3 rd 
25 Edition, Ed. A. Thomas, Associated Press, 1998; "Cytokines", Ed. A. Mire-Sluis & 
R. Thorpe, Academic Press, 1998; and "Cytokine Reference", Vol. 1, Ligands, A 
compendium of cytokines and other mediators of host defense, Eds. J.J. 
Oppendheim and M. Feldmann, Acedemic Press, 2001)). 
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II. Production of the Polymer-Modified Synthetic Bioactive Proteins of the 
Invention 

The production of the polymer-modified synthetic bioactive proteins of the 
invention can be envisioned as having the following steps: design, peptide synthesis, 
5 peptide ligation, folding of the full-length ligation product to generate protein, and 
assessment of the protein's bioactivity. We have found that polymer modification 
can be performed at one or more of the peptide synthesis, ligation or on the folded 
product steps. It is preferred, however, to attach polymer to the peptides or to the 
ligation product prior to folding. Proteins produced in this manner that exhibit a 
10 desired bioactivity are selected and represent the synthetic bioactive proteins of the 
invention. 

In a particularly preferred embodiment, a method is provided for producing 
the synthetic bioactive proteins of the invention that comprises chemically ligating 
peptide segments comprising non-overlapping amino acid sequences of a 

15 polypeptide chain of a target synthetic bioactive polymer-modified protein of the 
invention, where one or more of the peptide segments used for ligation has a water- 
soluble polymer attached thereto at a user-defined and pre-selected site. The 
polymer-modified polypeptide chain may then be folded to produce a polymer- 
modified synthetic bioactive protein of the invention. 

20 Another preferred method for producing the synthetic bioactive proteins of 

the invention comprises chemically ligating peptide segments comprising non- 
overlapping amino acid sequences of a polypeptide chain of a synthetic bioactive 
polymer-modified protein of the invention, and attaching one or more water-soluble 
polymers to a side-chain of an amino acid at one or more chemical ligation sites 

25 thereof. The polymer-modified polypeptide chain may then be folded to produce a 
polymer-modified synthetic bioactive protein of the invention. 

The invention also is drawn to a method for producing a polymer-modified 
synthetic bioactive protein, the method comprising (1) chemically ligating peptide 
segments comprising non-overlapping amino acid sequences of a polypeptide chain 

30 of a synthetic bioactive protein to form a full-length polypeptide chain corresponding 
to the synthetic bioactive protein, where at least one peptide segment comprises an 
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irregular amino acid having a first chemoselective functional group, (2) folding the 
polypeptide chain, and (3) attaching a water-soluble polymer thereto that comprises 
a second chemoselective group that is uniquely and mutually reactive with the first 
chemoselective group. 

5 In particular, these various processes embodied in the invention, and as 

exemplified in the Examples, can be illustrated as shown in the Figures. For 
example, Figures 1A - IE depict ligation schemes involving the attachment of a 
water-soluble polymer (U-B-Polymer-J* as defined herein) to partially or fully 
unprotected peptide segments ( -wwvww^ ) before or after ligation, or 

1 0 combinations thereof. In Figures 1 A - ID, Y aa represents the C-terminal amino acid 
on a first peptide segment that bears a unique chemoselective moiety (e.g., amino 
acid bearing alpha-carboxyl thiocstcr) for chemical ligation to a second peptide 
segment bearing a unique and mutually reactive N-terminal amino acid X aa (e.g., 
amino terminal cysteine) moiety that is capable of chemoselective chemical ligation 

15 with Y aa . Chemoselective reaction between Y aa and X aa generate a covalent linkage 
therein between (e.g., amide bond). Thus Y aa and X aa form a chemoselective 
ligation pairing. U n - , as shown in the Figures, represents a second unique 
chemoselective moiety that has been incorporated at a precise user-defined site on 
the side chain of an amino acid and is chemoselective for, and mutually reactive with 

20 group U- of the water-soluble polymer U-B-Polymer-J*. For example, when U„ is a 
side chain that has been modified to bear a ketone group, U- of the water-soluble 
polymer U-Polymer-J* is a group chemoselective for reacting with the ketone, e.g., 
an aminooxy group which yields an oxime bond therein between. The subscript "n" 
of U n - represent the number of amino acids and their side chains designed to bear 

25 chemoselective group U, for example, where two specific and user-defined sites are 
to be polymer modified n = 2, which also can be represented as U2 or U n =2- In all 
cases, n is a positive integer that is precisely controlled by design. Thus U n and U 
represent a second chemoselective ligation pairing that is compatible and unreactive 
with chemoselective groups of X aa and Y aa . FigureslA and IB illustrate two 

30 different potential reactions. In the first depicted reaction, a polypeptide chain 
bearing a U n functionality is ligated to a second polypeptide, and is then reacted with 
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a U-B-Polymer-J* moiety in order to obtain a polymer-modified polypeptide. In the 
second depicted reaction, the polypeptide chain bearing the U n functionality is 
reacted with a U-B-Polymer-J* moiety in order to obtain a polymer-modified 
polypeptide, and then ligated to a second polypeptide, to obtain a larger polymer- 
5 modified polypeptide. The figures differ in that in FigurelA, the polypeptide 
bearing the Xaa residue is to receive the polymer modification, whereas in Figure 
IB, the polypeptide bearing the Yaa residue receives the polymer modification. 
Figure 1C illustrates the ability of the present invention to modify multiple 
polypeptide chains, either before or after their ligation to form a larger polypeptide. 

10 In Figure ID, PG and PG' represent protecting groups, where PG' depicts an 
orthogonal protecting group, i.e., PG and PG' are removable under different 
conditions, and are useful where different water-soluble polymers are attached via 
same chemistry to U n groups, or where U n groups represent side-chain functional 
groups that one does not wish to modify with a polymer (e.g., side chains bearing 

15 reactive -NH2 or -SH where U group of water-soluble polymer is designed to react 
exclusively with primary amino or side chain thiols). Figure ID shows that 
protecting groups can be employed in order to protect desired side chains of the 
polypeptides being ligated in accordance with the methods of the present invention. 
Figure IE illustrates the diversity of groups that may be present or absent in 

20 the peptide segments employed in ligation and polymer modification according to 
Figures 1A - ID. 

Figures 2A - 2C depict additional schematics of processes for preparing 
synthetic bioactive proteins of the invention. In particular, Figures 2A - 2B depict 
ligation scheme involving the attachment of a water-soluble polymer U-B-Polymer- 

25 J* to the side chain of amino terminal group Xaa at a ligation site (e.g., side chain 
thiol of cysteine). Figure 2C illustrates the diversity of groups that may be present 
or absent in the peptide segments employed in ligation and polymer modification 
according to Figures 2A - 2B. 

Figures 3A - 3B depict additional schematics of processes for 

30 accomplishing multi-segment ligations that involve the chemical ligation of three or 
more non-overlapping peptide segments, i.e., at least one segment is a middle 
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segment corresponding to the final full-length ligation product. Peptides prepared in 
this manner can be used for preparing peptide segments involved in another ligation 
reaction, for example, as shown in Figures 1A-1E, Figures 2A-2C, and Figures 
4A-4C. In general, for multi-segment ligations, the middle segment(s) has either a 
5 protected Xaa group or a protected Yaa group to avoid cyclization or concatomer 
formation of that peptide depending on the ligation chemistry employed. For 
sequential or serial ligations, the Xaa group of a middle segment is protected (e.g., 
Cys(Acm)) while the Yaa group is unprotected (e.g., Yaa-COSR, where -COSR is an 
alpha-carboxyl thioester). Here the Yaa group is free to react with a second peptide 

10 bearing an unprotected Xaa group, where the second peptide is devoid of a free Yaa 
group. Following ligation, the protecting group is removed to regenerate the Xaa 
group for the next ligation reaction. This process can be continued, as needed 
thereby generating an elongated polypeptide chain. Protection of the Yaa group is 
particularly useful for convergent chemical ligation involving the production of a 

15 final ligation product composed of four or more segments. For example, for a 
protein target generated from a four-segment ligation (i.e., three ligation reactions), 
two segments corresponding to one end of the protein and two segments 
corresponding to the other end of the protein can be ligated in parallel, as opposed to 
sequentially, and the two ends joined in a final ligation reaction. Such a convergent 

20 chemical ligation schemes also can employ orthogonal ligation chemistries. Here 
again the diversity of groups that may be present or absent in such peptide segments 
are illustrated in Figures IE, 2C and 4C. 

Figures 4A - 4C illustrate how native chemical ligation and chemical 
modification of the resulting side-chain thiol may be accomplished in accordance 

25 with the principles of the present invention. In particular, Figures 4A - 4B depict 
the use of native chemical ligation and chemical modification of the resulting 
cysteine side-chain thiol at the ligation site(s) to form a "pseudo amino acid" 
(depicted by v|/Xaa) via thioalkylation and generation of chemically modified side 
chain (depicted by comprising a thioether bond. In an alternative embodiment 

30 not depicted, the side chain thiol can be converted to an alanine in a desulfurization 
reaction (Liang et al, J. Amer. Chem. Soc. (2001) /2?(4):526-533). An significant 
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aspect for both reactions is that any other side-chain thiols that one does not wish to 
convert or modify be protected with a suitable protecting group (PG) or for multi- 
segment ligations and orthogonal protecting group (PG') where the segment bearing 
the carboxy terminal Yaa group comprises a protected amino terminal Xaa group, 
5 i.e., PG-Xaa-peptide, which is provided by way of example in Figure 4B. Figure 
4C illustrates the diversity of groups that may be present or absent in the peptide 
segments employed in ligation and polymer modification according to Figures 4A - 
4B, as well as Figures 1A-1E, Figures 2A-2C, and Figures 3A-3B. 

In designing the polymer-modified synthetic bioactive proteins of the 

10 invention, an initial polypeptide backbone amino acid sequence for a target protein is 
obtained, typically from any number of various sources including databases such as 
gene and/or protein databases, or obtained de novo for instance through proteomic 
identification of a novel protein target of interest. As noted above, examples of such 
databases include, but are not limited to, GeneBank (Benson, et al., Nucleic Acids 

15 Res (1998) 26(1): 1-7; USA National Center for Biotechnology Information, National 
Library of Medicine, National Institutes of Health, Bethesda, MD, USA), TIGR 
Database (The Institute for Genomic Research, Rockville, MD, USA), Protein Data 
Bank (Brookhaven National Laboratory, USA), and the ExPASy and Swiss-Protein 
database (Swiss Institute of Bioinformatics, Geneve, Switzerland). Publicly 

20 available patent databases also may be used for this purpose. The polypeptide 
backbone employed for the design can represent a portion (e.g., a domain) or all of a 
target sequence (e.g., a mature form of the polypeptide chain). 

With the initial polypeptide backbone sequence, the desired full-length 
polymer-modified polypeptide structure is designed for a given target synthetic 

25 protein or panel of analogs thereof. This involves design of the polypeptide 
backbone, structure of the water-soluble polymer(s), and the ligation and water- 
soluble polymer attachment strategy for a given synthetic protein construct of 
interest. In particular, the polypeptide amino acid sequence on which a design is 
based is reviewed for the presence of one or more glycosylation sites, one or more 

30 immunogenic sites, or one or more protease sensitive site (i.e., regions substantially 
devoid of helix or beta-sheet secondary structure; typically loop and turn regions). 
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Other sites can be gleaned as well, including aggregation sites and 
glycosaminoglycan (GAG) binding sites, the latter of which is common to all 
chemokines. Moreover, the three dimensional structures of many proteins are 
known and can be used to aid design. 

5 The glycosylation sites include N-linked and O-linked glycosylation sites, 

which typically occur at disordered regions found in loops and turns. The 
immunogenic sites include hydrophilic sites at turns that are on the surface of the 
protein. The protease sensitive sites of interest include amino acid sequences that 
are recognized by proteases and on the surface of the protein, typically disordered 

10 regions. Aggregation and GAG sites for many proteins and methods for their 
identification, particularly chemokines are well known (See e.g., "The Cytokine 
Handbook", 3 rd Edition, Ed. A. Thomas, Associated Press, 1998; "Cytokines", Ed. 
A. Mire-Sluis & R. Thorpe, Academic Press, 1998; and "Cytokine Reference", Vol. 
1, Ligands, A compendium of cytokines and other mediators of host defense, Eds. 

15 J.J. Oppendheim and M. Feldmann, Acedemic Press, 2001)). 

For instance, immunogenicity or protease stability of a given protein target 
can be reduced or eliminated by site-specific attachment of one or more polymers to 
the immunogenic epitope or protease recognition sequence to generate a protease 
resistant or non-immunogenic water-soluble polymer-modified bioactive protein of 

20 the invention. Other sites amenable to polymer attachment include the amino and/or 
carboxy terminus for a given protein target. 

Glycosylation sites, including natural as well as sites engineered to contain 
them are particularly preferred sites for attachment of water-soluble polymers. This 
is based on a finding of the present invention that synthetic bioactive proteins can be 

25 produced that are modified exclusively at one or more of such sites with water- 
soluble polymers so as to replace and mimic the positive biological effect of one or 
more of the carbohydrate chains of a naturally occurring glycoprotein at that 
position. 

Such water-soluble polymer attachment sites can be determined using by 
30 modeling and/or biological or chemical analysis approaches. Modeling includes 
bioinformatic approaches, such as the use of homology modeling software 
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algorithms or programs that compare portions up to all of a given nucleic acid or 
amino acid sequence of a target protein, two-dimensional structure information for a 
polypeptide of interest, and three-dimensional structure information for a protein of 
interest, or a combination thereof. Any number of software programs and databases 
5 are well known and available for this purpose. (See, e.g., GeneBank (Benson, et al., 
Nucleic Acids Res (1998) 26(1); 1-7; USA National Center for Biotechnology 
Information, National Library of Medicine, National Institutes of Health, Bethesda, 
MD, USA), TIGR Database (The Institute for Genomic Research, Rockville, MD, 
USA), Protein Data Bank (Brookhaven National Laboratory, USA), and the ExPASy 

10 and Swiss-Protein database (Swiss Institute of Bioinformatics, Geneve, 
Switzerland). An example of a software program that integrates these methods is 
"MAXVECTOR" (Oxford Molecular Group), which contains multiple protein 
analysis tools for predicting secondary structure (Chou-Fasman and Robson-Garnier 
methods); hydrophilicity (Hopp- Woods, Kyte-Doolittle, Goldman-Engelman- Steitz 

15 (GES), and von-Heijne methods); hydrophobicity (Fauchere-Pliska, Janin, 
Manavalan, Sweet-Eisenberg methods); antigenicity (Hopp-Woods, Parker, 
Protrusion Index (Thornton), Welling methods); surface probability (Karplus & 
Schultz's methods); Flexibility (Janin and Emini methods); amphiphilicity 
(Eisenberg method); PROSITE based subsequence database and user-defined 

20 subsequences; customized subsequence sets; and protease site and proteolytic 
digestion predictions. 

For example, when scanning a protein target for the presence of one or more 
glycosylation sites, part or all of the native amino acid sequence typically are 
reviewed for N-linked glycosylation sites (which typically occur at Asn-Xaa-Thr or 

25 Asn-Xotf-Ser sequences in loops or turns) and O-linked sites (which typically occur 
at Set-Xaa-Xaa-?ro or Thv-Xaa-Xaa-Vxo sequences in loops or turns). Various 
algorithms and modeling programs are suitable for this purpose (See, e.g., ExPASy 
and Swiss-Protein database (Swiss Institute of Bioinformatics, Geneve, Switzerland; 
Gupta et al., Glycobiology (1999) 9(10):1009-1022; "DICTYOGLYC", Hansen et 

30 al., Biochemical Journal (1995) 308:801-813; "O-GLYCBASE" Hansen et al., 
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Nucleic Acids Research (1997) 25:278-282; NETOGLYX" Hansen et al., 
Glycoconjugate J. (1998) 15:115-130). 

To predict immunogenic sites in proteins, the following approach can be 
used. A secondary structure prediction is run on the protein sequence. Chou- 
5 Fasman parameters are particularly useful for this purpose (Chou et al., Advanced 
Enzymology (1978) 47:45-148; .Maclin et al., "Refining algorithms with knowledge- 
based neural networks: Improving the Chou-Fasman algorithm for protein folding", 
Working Paper 91-2, Computer Sciences Dept., Univ. of Wisconsin - Madison, 
1991; and Qian et al., J. Molecular Biology (1988) 202:865-884; "PEPEPLOT" 

10 program from Genetics Computer Group, Inc.). Disordered regions, and particularly 
tarns are identified. The protein sequence also is scanned for hydrophobicity, 
typically using a running average of six residues following standard methods (e.g., 
Fauchere-Pliska or Hopp-Woods hydrophobicity prediction methods or Bull- 
Breese), and the more hydrophilic regions are identified with particular attention to 

15 those that are turns. In conjunction, glycosylation sites, including N-linked sites and 
O-linked sites are identified. The most hydrophilic site in the protein, if it is in a 
turn and not predicted to be glycosylated, will have a high probability of 
immunogenicity and therefor serve as a site for polymer modification. The same 
basic rules can be applied to successively less hydrophilic regions to identify 

20 secondary sites for polymer modification. 

Biological and chemical analysis also may be used to identify specific sites 
for polymer attachment in generating the synthetic bioactive proteins of the 
invention. For instance, an isolated glycoprotein that is to be used a basis of design 
and synthesis of a synthetic bioactive protein of the invention can be subjected to 

25 enzymatic digestion and analysis e.g. by HPLC peptide mapping and mass 
spectrometry, to experimentally identify sites of O- and N-linked glycosylation. For 
identifying immunogenic sites, or protease sensitive sites, more highly disordered 
'loop' regions on the surface of the protein molecule that contain such regions may be 
identified in a number of biological or chemical ways. For instance, an isolated 

30 protein that is to be used a basis of design and synthesis of a synthetic bioactive 
protein of the invention can be subjected to limited proteolytic digestion with a 
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variety of proteases, followed by mass spectrometric analysis of the resulting peptide 
fragments to map the sites of greatest susceptibility to proteases following standard 
protocols (See, e.g., Coligan JE, Dunn BM, Ploegh HL, Speicher DW, Wingfield 
PT, editors. Current Protocols in Protein Structure, vol II. New York: John Wiley & 
5 Sons, 1997; Chait, BT, Structure (1994) 2:465-467; Kriwacki et al., J Chromatogr A 
(1997) 777:23-30; Hillenkamp et al., Anal Chem (1991) 63A1 193-1201; Fenn et al., 
Mass Spectr Rev (1990) 9:37-70; and Siuzdak G. Mass Spectrometry for 
Biotechnology. San Diego: Academic Press, 1996). And there are many proteases 
commonly used for such purposes (Konigsberg WH, Methods Enzymol (1995) 

10 262:331-346; Carrey, EA, In: Creighton T, editor. Protein Structure, a Practical 
Approach. New York: IRL Press, (1989) pp:l 17-144; and Coligan, JE, Dunn BM, 
Ploegh HL, Speicher DW, Wingfield PT, editors. Current Protocols in Protein 
Structure, vol II. New York: John Wiley & Sons, 1997). Surface loop regions of the 
folded protein's polypeptide chain are usually associated with increased 

15 susceptibility to proteolytic digestion. A similar approach using limited 'random' 
chemical modification followed by digestion and peptide mapping by mass 
spectrometry can also be used to identify exposed surface regions of a protein of 
unknown three-dimensional structure. 

For identifying immunogenic sites, protease sensitive sites or other sites 

20 suitable for polymer modification, alanine scanning also can be performed to 
identify specific residues in the polypeptide backbone that are amenable as water- 
soluble polymer attachment sites. This is particularly useful for identifying invariant 
residues required for biological activity, and when a water-soluble polymer 
attachment site is positioned at an immunogenic or protease sensitive site of the 

25 target molecule. Alanine scanning is particularly useful when used in combination 
with other protein analysis tools, such as described above. Another approach is to 
systematically prepare a library of synthetic proteins of interest with the modifying 
water-soluble polymer moiety being located at a different site in the polypeptide 
chain of each molecule. After folding the library of polymer-modified polypeptide 

30 chains, a functional selection/assay can be performed to separate, for example, 
folded from non-folded molecules, or receptor-binding from non-binding molecules. 
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Then the location of the sites of polymer modification can be determined for the 
functional molecules. Such an approach can make use of the principles of the 
'protein signature analysis' method to facilitate identification of non-interfering sites 
of polymer modification (Muir et al., Chemistry & Biology (1996) 3:817-825). 
5 In conjunction with the design, the peptides or polypeptide segments utilized 

for synthesizing the polypeptide backbone are constructed. This involves selection 
of suitable ligation sites that are chosen based on the ligation chemistry selected for 
assembling the various polypeptide backbone segments, the polymer attachment 
chemistry chosen for a given target protein, and the particular polymer attachment 

10 sites. When native chemical ligation is employed, cysteine ligation sites are 
determined by scanning the target polypeptide backbone amino acid sequence for 
suitable naturally occurring cysteine residue. When Extended Native Chemical 
Ligation" is employed, as described herein, ligation sites can be selected by scanning 
the target polypeptide backbone amino acid sequence for suitable naturally occurring 

15 ligation site junctions that permit robust ligations, such as Xaa-G\y sites. Because 
extended native chemical ligation is not limited to ligation at cysteine residues, any 
number of residues may serve as the ligation site junction. In some instances, a 
combination of native and extended native chemical ligation may be part of the 
design. 

20 In a preferred embodiment, native chemical ligation is used to generate part 

or all of the full-length polypeptide chain. Cysteines present in the naturally 
occurring protein on which the synthetic bioactive protein in based can be used as 
the chemical ligation sites. However, where a preferred ligation junction is devoid 
of a suitable cysteine, the non-cysteine amino acid at that position can be replaced 

25 with a cysteine so as to permit native chemical ligation at that site. If desired, the 
newly introduced cysteine can be converted to a pseudo amino acid residue 
corresponding to the original amino acid at that position, as described herein. 
Alternatively, when the cysteine is introduced at a site for polymer modification, the 
side chain thiol can be exploited for the attachment of a thiol-reactive water-soluble 

30 polymer construct, provided that all other cysteines in the target polypeptide that one 
does not wish to modify are protected. 
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In another preferred embodiment, extended native chemical ligation, as 
described herein, can be utilized to generate part or all of the full-length polypeptide. 
For this method, N-terminal Net-substituted 2 or 3 carbon chain alkyl or aryl thiol 
amino acids may be employed. Such residues (where present at the N- terminus of a 
5 peptide or polypeptide segment used for ligation) can be advantageously used to 
ligate that polypeptide to a polypeptide having a C-terminal a-carboxy thioester 
moiety, in accordance with the methods of extended native chemical ligation 
described herein. 

Typically, the synthesis of peptides employs stepwise standard Boc and/or 
10 Fmoc solid phase peptide synthesis using standard automated peptide synthesizers, 
or manually following standard protocols, or ordered and purchased from 
commercial vendors. ("Synthetic Peptides, A User's Guide," G.A. Grant, Ed., W.H. 
Freeman & Company, New York, NY, 1992; "Priciples of Peptide Synthesis, 2nd 
ed.," M. Bodanszky, Ed., Springer- Verlag, 1993; "The Practice of Peptide Synthesis, 
15 2nd ed.," M. Bodanszky and A. Bodanszky, Eds., Springer-Verlag, 1994; and 
"Protecting Groups," P J. Kocienski, Ed., Georg Thieme Verlag, Stuttgart, Germany, 
1994; "Fmoc Solid Phase Peptide Synthesis, A Practical Approach, Eds. W.C. Chan 
and P.D. White, Oxford University Press, 2000). For peptides utilized for thioester- 
mediated ligation, such as for native chemical ligation, they can be made following 
20 standard protocols as well, (see, e.g., Dawson et al, Science (1994) 266:776-779; 
Canne et al. Tetrahedron Lett. (1995) 36:1217-1220; Kent, et al., WO 96/34878; 
Kent, et al, WO 98/28434; Ingenito et al., JACS (1999) 121(49):1 1369-11374; and 
Hackeng et al., Proc. Natl. Acad. Sci. U.S.A. (1999) 96:10068-10073); Amiato et al., 
supra.). 

25 For ligation and site-specific attachment of water-soluble polymers, 

chemically orthogonal strategies are employed in the synthesis of the peptides (See, 
e.g., Figures 1-4, and Examples) so as to avoid side reactions that result in unwanted 
attachment. For instance, depending on ligation design and the polymer attachment 
approach, a variety of orthogonal synthesis strategies can be exploited. In particular, 

30 the nature of the water-soluble polymer to be attached, and in particular the 
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functional group for joining it to the polypeptide are considered, for instance as 
discussed below for the preferred glyco-mimetic polymers of the invention. 

In particular, the water-soluble polymer is made to comprise a unique 
functional group U that is selectively reactive with a unique functional group on a 
5 target peptide employed for ligation, full-length material or even the folded 
polypeptide. As chemical synthesis is employed, the peptide is made to contain a 
mutually reactive chemoselective group at a precise, user-defined site. This aspect 
of the invention embodies the principles of peptide synthesis (protecting group 
strategies) and chemical ligation (partial or no protecting group strategies). For the 

10 protecting group strategy, all potentially reactive functional groups except for the 
desired functional group on the water-soluble polymer and its mutually reactive 
functional group present on the target molecule are blocked with suitable protecting 
groups. Many protecting groups are known and suitable for this purpose (See, e.g., 
"Protecting Groups in Organic Synthesis", 3rd Edition, T.W. Greene and P.G.M. 

15 Wuts, Eds., John Wiley & Sons, Inc., 1999; NovaBiochem Catalog 2000; "Synthetic 
Peptides, A User's Guide," G.A. Grant, Ed., W.H. Freeman & Company, New York, 
NY, 1992; "Advanced Chemtech Handbook of Combinatorial & Solid Phase Organic 
Chemistry," W.D.. Bennet, J.W. Christensen, L.K. Hamaker, M.L. Peterson, 
M.R.Rhodes, and H.H. Saneii, Eds., Advanced Chemtech, 1998; "Principles of 

20 Peptide Synthesis, 2nd ed.," M. Bodanszky, Ed., Springer-Verlag, 1993; "The 
Practice of Peptide Synthesis, 2nd ed.," M. Bodanszky and A. Bodanszky, Eds., 
Springer-Verlag, 1994; and "Protecting Groups," P.J. Kocienski, Ed., Georg Thieme 
Verlag, Stuttgart, Germany, 1994). 

Thus, the water-soluble polymer can represent be made to posses a wide 

25 range of functional groups, such as those described above. For the partial or no 
protecting group strategy, the functional group on the polymer and its mutually 
reactive functional group present on the target peptide or polypeptide employ a 
chemoselective reaction pair in which other functional groups may be present in the 
reaction system but are unreactive. This includes groups amenable to amine capture 

30 strategies (e.g., ligation by hemiaminal formation, by imine formation, and by 
Michael addition), thiol capture strategies (e.g., ligation by mercaptide formation, by 
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exchange involving cysteine or thiol contain side-chain amino acid derivative), and 
orthogonal ligation coupling strategies (e.g., ligation by thiazolidine formation, by 
thioester exchange, by thioester formation, by disulfide exchange, and by amide 
5 formation)(See, e.g., Coltart, DM., Tetrahedron (2000) 56:3449-3491). 

A preferred chemoselective U group for this embodiment will comprise a 
residue of a unique functional group employed in an aqueous compatible ligation 
chemistry such as native chemical ligation (Dawson, et al, Science (1994) 266:776- 
779; Kent, et al, WO 96/34878), extended general chemical ligation (Kent, et al, 

10 WO 98/28434), oxime-forming chemical ligation (Rose, et al, J. Amer. Chem. Soc. 
(1994) 116:30-33), thioester forming ligation (Schnolzer, et al, Science (1992) 
256:221-225), thioether forming ligation (Englebretsen, et al, Tet. Letts. (1995) 
36(48):8871-8874), hydrazone forming ligation (Gaertner, el al, Bioconj. Chem. 
(1994) 5(4):333-338), and thiazolidine forming ligation and oxazolidine forming 

15 ligation (Zhang, et al, Proc. Natl. Acad. Sci. (1998) 95(16):9184-9189; Tarn, et al, 
WO 95/00846) or by other methods (Yan, L.Z. and Dawson, P.E., "Synthesis of 
Peptides and Proteins without Cysteine Residues by Native Chemical Ligation 
Combined with Desulfurization," J. Am. Chem. Soc. 2001, 323, 526-533, herein 
incorporated by reference; Gieselnan et al, Org. Lett. 2001 3(9):133 1-1334; Saxon, 

20 E. et al, "Traceless" Staudinger Ligation for the Chemoselective Synthesis of 
Amide Bonds. Org. Lett. 2000, 2, 2141-2143). 

Given the various attachment chemistries described above, the bond formed 
between the water-soluble polymer and a target peptide or polypeptide can comprise 
a residue of a bond selected from carbonate, ester, urethane, orthoester, amide, 

25 amine, oxime, imide, urea, thiourea, thioether, thiourethane, thioester, ether, 
thaizolidine, hydrazone, oxazolidine and the like. The most preferred bonds are 
oxime and amide bonds. 

The peptide ligation step, for example as shown in Figures 1-4, may employ 
solid or solution phase ligation strategies. Figure 8 depicts another route for 

30 precision attachment of a water-soluble polymer to a peptide segment employable 
for ligation and production of polymer-modified synthetic bioactive proteins of the 
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invention. The first step employs solid phase peptide synthesis ("SPPS") (e.g., Fmoc 
or Boc SPPS), in which an amino acid side chain targeted for polymer attachment is 
protected with an orthogonal protecting group (e.g., if using Fmoc SPPS, a Boc 
group can be used to protect the site of polymer attachment, or if using Boc SPPS, an 
5 Fmoc group can be employed as the orthogonal protecting group). Following 
peptide synthesis, the orthogonal protecting group is selectively removed while the 
rest of the peptide remains protected. This affords a single attachment site for the 
next step - solid phase polymer synthesis. Once the orthogonal protecting group is 
removed, the polymer chain is attached as a precursor. More preferably, the polymer 

10 chain is built through successive rounds of polymer synthesis using a process 
depicted in Figures 6A - 6D. Although a single polymer attachment site is shown, 
more than one can be provided. 

As noted above, chemical ligation involves the formation of a selective 
covalent linkage between a first chemical component and a second chemical 

15 component. Unique, mutually reactive, functional groups present on the first and 
second components can be used to render the ligation reaction chemoselective. For 
example, the chemical ligation of peptides and polypeptides involves the 
chemoselective reaction of peptide or polypeptide segments bearing compatible 
unique, mutually reactive, C-terminal and N-terminal amino acid residues. Several 

20 different chemistries have been utilized for this purpose, examples of which include 
native chemical ligation (Dawson, et al., Science (1994) 266:776-779; Kent, et al., 
WO 96/34878; Kent, et al., WO 98/28434), oxime forming chemical ligation (Rose, 
et al., J. Amer. Chem. Soc. (1994) 116:30-34), thioester forming ligation (Schnolzer, 
et al., Science (1992) 256:221-225), thioether forming ligation (Englebretsen, et al., 

25 Tet. Letts. (1995) 36(48):8871-8874), hydrazone forming ligation (Gaertner, et al., 
Bioconj, Chem. (1994) 5(4):333-338), and thiazolidine forming ligation and 
oxazolidine forming ligation (Zhang, et al., Proc. Natl. Acad. Set (1998) 
95(16):9184-9189; Tarn, et al., WO 95/00846; US Patent No. 5,589,356); Gieselman 
et al., Selenocysteine-mediated native chemical ligation (Org. Lett. (2001) 

30 3(9):1331-1334); and Staudinger amide forming chemical ligation (Saxon et al., 
Org. Lett. (2000) 2:2141-2143). Thus, as will be appreciated, any chemoselective 
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reaction chemistry that can be applied to the ligation of unprotected peptide 
segments that is amenable for such purpose. 

Reaction conditions for a given ligation chemistry are selected to maintain 
the desired interaction of the peptide or polypeptide segments employed for ligation. 
5 For example, pH and temperature, water-solubility of the ligation label, ratio of the 
first segment to the second segment, water content and composition of the reaction 
mixture can be varied to optimize ' ligation. Addition or exclusion of reagents that 
solubilize the ligation segments to different extents may further be used to control 
the specificity and rate of the desired ligation reaction, i.e., control exposure and 
10 presentation of reactive groups by manipulating solubility of the peptide or 
polypeptide segments. Reaction conditions are readily determined by assaying for 
the desired chemoselective reaction product compared to one or more internal and/or 
external controls. 

Where the ligation involves the joining of a polypeptide that possesses an N- 

15 terminal cysteine residue, the procedure of native chemical ligation is preferably 
employed (Dawson, et al., Science (1994) 266:776-779; Kent, et al„ WO 96/34878; 
Kent, et al., WO 98/28434)). This methodology has proven a robust methodology 
for generating a native amide bond at the ligation site. Native chemical ligation 
involves a chemoselective reaction between a first peptide or polypeptide segment 

20 having a C-terminal a-carboxythioester moiety and a second peptide or polypeptide 
having an N-terminal cysteine residue. A thiol exchange reaction yields an initial 
thioester-linked intermediate, which spontaneously rearranges to give a native amide 
bond at the ligation site while regenerating the cysteine side chain thiol. In many 
instances, the sequence of the natural protein will comprise suitably placed cysteine 

25 residues such that polypeptide fragments having an N-terminal cysteine residue may 
be synthesized and used in a native chemical ligation reaction, hi other instances, 
the peptide synthesis can be conducted so as to introduce cysteine residues into a 
polypeptide for this purpose. Thus in its standard form, native chemical ligation 
involves thioester-mediated chemoselective reaction at a cysteine residue in a target 

30 polypeptide sequence; a peptide bond is formed at the ligation site and the side chain 
of the Cys is regenerated in native form. 



WO 02/20033 



PCT7US01/21930 



-52- 

Alternatively, the proteins of the present invention may be synthesized 
through the use of "Pseudo-Native Chemical Ligation," or "Extended Native 
Chemical Ligation. Pseudo-Native Chemical Ligation involves the use of non- 
naturally occurring pseudo-amino acid residues at preselected positions in the 

5 peptides employed in the protein synthesis (e.g., See Figure 4). The structures of 
such pseudo-amino acids mimic both the structures of cysteine and the structures of 
the amino acids that are naturally found at such preselected positions in the protein 
being synthesized. Pseudo-native chemical ligation is thus directed to the 
thioalkylation of cysteine side chains generated at ligation sites from native chemical 

10 ligation. A preferred aspect is thioalkylation of cysteine ligation sites wherein at 
least one peptide contains a native cysteine having its thiol side chain protected with 
a suitable protecting group. 

In a preferred embodiment of the invention, the thiol moiety of a cysteine 
group is modified into a desired side chain, for example, into the side chain of a 

15 ribosomally specified amino acid, an analog of such an amino acid, or into a non- 
ribosomally specified amino acid. As used herein, a ribosomally specified amino 
acid is an amino acid that is recognized by ribosomes in the process of protein 
translation and can be incorporated into a ribosomally produced protein. 
Considerable published literature exists describing chemical modifications of the 

20 cysteine side chain thiol moiety (see, e.g., "Current Protocols in Protein Science," 
Edited by: John E. Coligan et al, John Wiley & Sons, NY (2000)). Kaiser, E.T. has 
described the conversion of cysteine residue side chains to mimic the chemical 
properties of a naturally occurring amino acid side chain (see, e.g., Kaiser, E.T. et 
al, "Chemical Mutation Of Enzyme Active Sites," Science. 1984 Nov 

25 2;226(4674):505-ll). Additionally, the use of a cysteine side chain to introduce a 
label into a peptide or protein has been described. Cysteine side chain modifications 
are reviewed in Chemistry of Protein Conjugation and Crosslinking, S. S. Wong, 
(1991, CRC Press); Chemical Modification of Proteins, Gary E. Means et al, (1971, 
Holden-Day), Chemical Modification of Proteins: Selected methods and analytical 

30 procedures, Glazer, A.N. et al. (1975, Elsevier); Chemical Reagents for Protein 
Modification, RL Lundblad (1991, CRC Press). Tam et al. (Biopolymers (1998) 
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46:319-327) have disclosed the use of homocysteine (-CH 2 -CH 2 -SH) for non-cys 
native chemical ligation, followed by thioalkylation using methyl p- 
nitrobenzenesulfonate (methylating reagent) to convert the homocysteine side chain 
to a native methionine side chain (-CH2-CH2-S-CH3). The present invention also 
5 can be used for converting homocysteines to pseudo amino acids as well, i.e., to 
amino acids other than methionine. However, as with the conversion of cysteines 
described herein, in accordance with the present invention it is necessary to use 
protecting groups to avoid destruction of native cysteines involved in disulfide 
pairing for peptides that contain at least one native cysteine that one does not wish to 

1 0 convert. Suitable protecting groups are described below. 

While the method of pseudo-native chemical ligation does not facilitate the 
mimicking of the side chains of certain ribosomally-specified amino acids (e.g., the 
side chains of glycine, alanine, valine, and proline) (alanine's side chain can, 
however, be formed through a desulfurization reaction (Liang, Z.Y. and Dawson, 

15 P.E., "Synthesis of Peptides and Proteins without Cysteine Residues by Native 
Chemical Ligation Combined with Desulfurization," J. Am. Chem, Soc. 2001, 123, 
526-533, herein incorporated by reference), it may be used to form side chains that 
mimic many ribosomally-specified or non-encoded amino acids. Amino acids 
produced in accordance with the pseudo-native chemical ligation method of the 

20 present invention will contain a thioether linkage, and will have no beta-branching 
(in that they will all include a methyl group at the beta position, i.e., aa-CH 2 -S-. 
Thus, the pseudo-amino acid versions of the beta-branched amino acids, isoleucine 
and threonine can be made to have the pendant side chain structure, without having 
the beta geometry and its attendant constraints. 

25 Significantly, the methods of the present invention may be used to form 

amino acid side chains that are the same length as that of ribosomally specified 
amino acids, or are longer or shorter than such length. Such alteration in side chain 
length can be used to stabilize (or destabilize) the three-dimensional conformation to 
increase protein stability (or to enhance the ability of the protein to alter its 

30 conformation and thereby accept a different range of substrates, inhibitors, receptors, 
ligands, etc. relative to those accepted by the naturally occurring protein. For 
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example, Cys-CH 2 -SH + Br-CH 2 -COOH yields Cys-CH 2 -S-CH 2 -COOH (such 
"pseudo-glutamic acid" has one additional side chain atom, namely the -S- group; 
alternatively, if used in the place of aspartic acid, it will possess two additional side 
chain atoms, namely a -CH 2 -S- group). Other side chains have the same number of 

5 atoms in the side chain, but differ by inclusion of the thioether linkage (-S-). For 
example, Cys-CH 2 -SH + Br-CH 2 -CH 2 -NH-i>G, followed by removal of PG yields 
Cys-CH 2 -S-CH 2 -CH 2 -NH 2 . The resulting structure has no additional atoms in the 
side chain, but one -CH 2 - group is replaced with -S-. Methionine is another example 
here, Cys-CH 2 -SH + I-CH 2 -CH 3 yields Cys-CH 2 -S-CH 2 -CH 3 (versus native met 

10 structure of Met-CH 2 -CH 2 -S-CH 3 ); thus the thioether is relocated. Arginine also: 
Cys-CH 2 -SH + Br-CH 2 -NH-CH((-NH 2 )(=NH 2 + )) yields Cys-CH 2 -S-CH 2 -NH-CH((- 
NH 2 )(= NH 2 '")). Preferably, protection of reactive amino groups, particularly for the 
constructing pseudo lysine can be employed to avoid unwanted side reactions. Once 
the thioalkylation reaction is performed, the protecting group can be removed. 

15 In general, where the desire is to mimic a naturally occurring protein as 

closely as possible, it is most preferred to employ a pseudo amino acid molecule 
having a side chain length that is the same length as that of the ribosomally-specified 
amino acid normally present at such position in the protein; it is less preferred to 
employ a pseudo amino acid molecule having a side chain length that is one atom 

20 longer than that of the ribosomally-specified amino acid, and still less preferred to 
employ a pseudo amino acid molecule having a side chain length that is two atoms 
longer than that of the ribosomally-specified amino acid. Moreover, its is preferred 
to select a cysteine ligation site that is in a location where genetic changes are not 
likely to disrupt function or where amino acids at that site in related proteins are 

25 conserved. Such sites can be identified by alanine scanning, homology modeling, 
and other methods. 

In pseudo-native chemical ligation, a peptide containing an amino terminal 
cysteine residue is ligated to a peptide having a carboxy terminal thioester, as in 
native chemical ligation. The thiol side chain of the cysteine is then reacted with a 
30 compound of the formula R aa -X, where X is a good leaving group, and R aa is a group 
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whose structure mimics the terminal portion of the side chain of an ribosomally- 
specified or synthetic amino acid. 

Significantly, the reactions of pseudo-native chemical ligation work with 
either the natural L-configuration of cysteine side chain, or the D-configuration. Use 
5 of the D-configuration can impart protease resistance at that ligation site, and thus 
may be desired when increased stability to proteolysis is desired. However, in using 
the D-cysteine, the backbone structure at that site will be altered. Such alteration, in 
addition to protease resistance, may be desired to alter bioactivity. However, to 
minimize impact on bioactivity, it is preferred to locate the D-cysteine at a site of 

10 high flexibility, such as a disorded region, such as at a disorded loop that will be 
located on the surface of the resulting folded molecule, on at a disorded terminus of 
the molecule. Desirably, the reactions of pseudo-native chemical ligation may be 
used to place large, charged side chains (e.g., the side chains of Lys, Arg, Asp or 
Glu) on the surface of the synthesized molecule. 

15 Examples of suitable good leaving groups, X, include halogens, especially 

Iodine and Bromine. Examples of R aa groups include P0 4 , COOH, COO, CONH 2 , 
guanidinium, amine, alkyl, substituted alkyl, aryl, substituted aryl, imidazole, 
alkylated imidazole, indole, or alkylated indole groups. 

The selection of which R aa to employ will depend upon the amino acid side 

20 chain desired to be present at a particular position. Thus, for example, a desired 
polypeptide or protein having the amino acid sequence: 
aa N H 2 -Q-aa x -aa y -W-aacooH 
where Q and W each denote the optional presence or absence of additional amino 
acid residues, and aa x and aa y denote internal adjacent residues (having side chains x 

25 and y, respectively), and aaNH 2 and respectively denote the amino (N-) termimal 
residue and the carboxy (C-) terminal residue of the polypeptide or protein an be 
synthesized by preparing two peptide fragments: 

aa N H 2 -Q-aa x -COSR and Cys-W-aa CO oH 
where Cys denote the replacement of aa y with cysteine, and R is any group 

30 compatible with the thioester group, including, but not limited to, aryl, benzyl, and 
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alkyl groups. Examples of R include 3-carboxy-4-nitrophenyl thioesters, benzyl 
esters, and mercaptoproprionic acid leucine esters (See, e.g., Dawson et al, Science 
(1994) 266:776-779; Canne et al Tetrahedron Lett. (1995) 36:1217-1220; Kent, et 
al, WO 96/34878; Kent, et al, WO 98/28434; Ingenito et al, JACS (1999) 
5 121(49):11369-11374; and Hackeng et al, Proc. Natl. Acad. Sci. U.S.A. (1999) 
96:10068-10073). Other examples include dithiothreitol, or alkyl or aryl thioesters, 
which can be produced by intein-mediated biological techniques, which also are well 
known (See, e.g., Chong et al, Gene (1997) 192:277-281; Chong et al, Nucl. Acids 
Res. (1998) 26:5109-5115; Evans et al, Protein Science (1998) 7:2256-2264; and 
10 Cotton et al, Chemistry & Biology (1999) 6(9):247-256); and then ligating the 
fragments together to form: 

aa N H 2 -Q-aa x -Cys-W-aacooH- 
The ligated fragment is then reacted with R y -X, where R y is a side group that 
mimics the structure of the y side chain). The reaction is conducted under conditions 
15 sufficient to convert the thiol group of the cysteine into a "pscudo-y" side chain. For 
example, if R aa is selected to be CH 2 -COOH or (CH 2 )2-COOH, then the reaction will 
lead to the formation an amino acid residue that mimics the structure and function of 
aspartic acid ("pseudo-Asp") or glutamic acid ("pseudo-Glu"). As will be 
appreciated, in light of the above description, more complicated synthesis can be 
20 conducted employing more than two peptide fragments. 

A significant feature of this approach is that cysteine residues that one does 
' not wish to modify in the reacting segments be side chain protected, e.g. as 
Cys(Acm), to prevent chemical modification of Cys residues other than the one(s) at 
the ligation site(s), or that any other Cys residues in the reacting segments be 
25 intended for the simultaneous formation of identical 'pseudo amino acids' by the 
chemical modification reaction after ligation. 

As used herein, the symbol cp denotes a benzyl group; IM denotes an 
imidazole group, and IN denotes an indole group; PG denotes a protecting group. 
Below is a summary of R aa side groups that may be used to synthesize peptides 
30 containing pseudo-amino acid residues in accordance with the present invention 
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(where X is a halogen (I, Br, CI, F, with I and Br preferred for most, and F preferred 
for 9 attachment): 

Basic Amino Acids: 

Lys (no extra atoms) 
5 -CH 2 -SH + X-CH2-CH2-NH-PG, followed by deprotection 

gives -CH2-S-CH2-CH2-NH2 
Arg (no extra atoms) 

-CH2-SH + X-CH 2 -NH-C((NH 2 )(=NH 2 +)) 
gives -CH 2 -S-CH2-NH-C((NH 2 )(=NH2+)) 
10 His (2 extra atoms) 

-CH 2 -SH + X-CH 2 -/M gives -CH2-S-CH2-/M 
Acidic Amino Acids: 
Asp (2 extra atoms) 

-CH2-SH + X-CH2-COOH gives -CH 2 -S-CH 2 -COOH 
15 Glu (1 extra atoms) 

-CH 2 -SH + X-CH2-COOH gives -CH2-S-CH2-COOH 
Uncharged Polar Amino Acids: 
Tyr (no or 1 extra atom) 

-CH2-SH + F-cp-pOH gives -CH 2 -S-cp-pOH)(no extra atoms, same geometry 
20 -CH 2 -SH + Br/I-CH 2 -(p-pOH gives -CH 2 -S-CH 2 -q)-pOH (1 extra atom) 

Gin (1 extra atom) 

-CH 2 -SH + X-CH 2 -C(0)(NH 2 ) gives -CH 2 -S-CH 2 -C(0)(NH 2 ) 
Asn (2 extra atoms) 

-CH 2 -SH + X-CH 2 -C(0)(NH 2 ) gives -CH 2 -S-CH 2 -C(0)(NH 2 ) 
25 Ser (2 or 3 extra atoms) 

-CH 2 -SH + X-CH 2 OH) gives -CH 2 -S-CH 2 OH (2 extra atoms) 
-CH 2 -SH + X-CH 2 -CH 2 OH) gives -CH2-S-CH2-CH2OH (3 extra atoms) 
Thr (2 or 3 extra atoms, missing beta branching) 

-CH 2 -SH + X-CH((CH 3 )(0-PG)) followed by removal of PG 
30 gives -CH 2 -S-CH((CH 3 )(OH)) (2 extra atom) 
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-CH 2 -SH + X-CH 2 -CH((CH 3 )(0-PG)) followed by removal of PG 

gives -CH 2 -S-CH 2 -CH((CH 3 )(OH)) (3 extra atoms) 
Non-Polar Amino Acids: 
Leu (1 or 2 extra atoms) 
5 -CH 2 -SH + X-CH((CH 3 )(CH 3 )) gives -CH 2 -S-CH((CH 3 )(CH 3 )) (1 extra 

atom) 

-CH 2 -SH + X-CH 2 -CH((CH 3 )(CH 3 )) gives -CH 2 -S-CH 2 -CH((CH 3 )(CH 3 )) (2 
extra atoms) 
He (2 or 3 extra atoms, missing beta branching) 
10 -CH 2 -SH + X-CH(CH 3 )-CH 2 -CH 3 gives -CH 2 -S-CH(CH 3 )-CH 2 -CH 3 (2 extra 

atoms) 

-CH 2 -SH + X-CH 2 -CH(CH 3 )-CH 2 -CH 3 gives -CH 2 -S-CH 2 -CH(CH 3 )-CH 2 - 
CH 3 (3 extra atoms) 
Phe (1 or 2 extra atoms) 
15 -CH 2 -SH + F-9 gives -CH 2 -S-<p (1 extra atom) 

-CH 2 -SH + Br/I-CH 2 -q> gives -CH 2 -S-CH 2 -q> (2 extra atoms) 
Met (no extra atoms) 

-CH 2 -SH + I-CH2-CH3 gives -CH 2 -S-CH 2 -CH 3 
Trp (1 extra atom) 
20 -CH 2 -SH + F-IN gives -CH 2 -S-/A^ 

However, where it is either inconvenient or undesirable to modify a protein 
sequence so as to introduce a cysteine or homocysteine residue at a given N-terminus 
of a polypeptide utilized for ligation, or utilize a pseudo amino acid at the ligation 
site, the method of native chemical ligation may be extended using polypeptides 
25 whose N-terminus has been modified to contain an N-substituted, and preferably, 
Na-substituted, 2 or 3 carbon chain amino alkyl or aryl thiol, and thereby permit the 
principles of native chemical ligation to be employed with polypeptides lacking 
cysteine residues (see, U.S. Patent Application Serial No. 60/231,339, herein 
incorporated by reference). 
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The method of "Extended Native Chemical Ligation" involves ligating a first 
component comprising a carboxyl thioester, and more preferably, an cc-carboxyl 
thioester with a second component comprising an acid stable N-substituted, and 
preferably, Na-substituted, 2 or 3 carbon chain amino alkyl or aryl thiol. 
5 Chemoselective reaction between the carboxythioester of the first component and the 
thiol of the N-substituted 2 or 3 carbon chain alkyl or aryl thiol of the second 
component proceeds through a thioester-linked intermediate, and resolves into an 
initial ligation product. More specifically, the thiol exchange occurring between the 
COSR thioester component and the amino alkyl thiol component generates a 
10 thioester-linked intermediate ligation product that after spontaneous rearrangement 
generates an amide-linked first ligation product through a 5-membered or 6- 
membered ring intermediate depending upon whether the amino alkyl thiol 
component has formula I or II, respectively: 



where J 1 is a peptide or polypeptide having one or more optionally protected amino 
acid side chains, or a moiety of such peptide or polypeptide, a polymer, a dye, a 
suitably functionalized surface, a linker or detectable marker, or any other chemical 
moiety compatible with chemical peptide synthesis or extended native chemical 

20 ligation; Rl, R2 and R3 are independently H or an electron donating group 
conjugated to CI; with the proviso that at least one of Rl, R2 and R3 comprises an 
electron donating group conjugated to CI; and J2 is a peptide or polypeptide having 
one or more optionally protected amino acid side chains, or a moiety of such peptide 
or polypeptide, a polymer, a dye, a suitably functionalized surface, a linker or 

25 detectable marker; or any other chemical moiety compatible with chemical peptide 
synthesis or extended native chemical ligation. 

The N-substituted 2 or 3 carbon chain alkyl or aryl thiol [HS-C2-C1(R1)-] or 
[HS-(C3(R3)-C2(R2)-C1(R1>] at the ligation site is amenable to being removed, 
under peptide-compatible conditions, without damage to the product, to generate a 

30 final ligation product of formula III, having a native amide bond at the ligation site: 



J 1 -C(0)-N(C 1 (Rl )-C2-SH)-J2 

J 1 -C(0)-N(C 1 (Rl )-C2(R2)-C3(R3)-SH)-J2 



I 



15 



II 



J1-C(0)-HN-J2 



III 
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where Jl, J2, Rl, R2, and R3 are as defined above. 
The Rl, R2 and R3 groups are selected to facilitate cleavage of the N-Cl 
bond under peptide compatible cleavage conditions. For example, electron donating 
groups, particularly if conjugated to CI, can be used to form a resonance stabilized 
cation at CI that facilitates cleavage. The chemical ligation reaction preferably 
includes as an excipient a thiol catalyst, and is carried out around neutral pH 
conditions in aqueous or mixed organic-aqueous conditions. Chemical ligation of 
the first and second components may proceed through a five or six member ring that 
undergoes spontaneous rearrangement to yield an N-substituted amide linked 
ligation product. Where the first and second components are peptides or 
polypeptides, the N-substituted amide linked ligation product has formula IV or V: 
J 1 -C(0)-N<x(Cl (R1)-C2-HS)-CH(Z2)-C(0)-J2 IV 
J 1 -C(0)-Ncc(C 1 (Rl )-C2(R2)-C3(R3)-HS)-CH(Z2)-C(0)-J2 V 
where Jl, J2 and Rl, R2, R3 and Z2 are as defined above 
The conjugated electron donating groups Rl, R2 or R3 of the N-substituted 
amide bonded ligation product facilitate cleavage of the N-Cl bond and removal of 
the 2 or 3 carbon chain alkyl or aryl thiol from the N-substituted amide-linked 
ligation product. Removal of the alkyl or aryl thiol chain of the N under peptide- 
compatible cleavage conditions generates a ligation product having a native amide 
bond at the ligation site. Where the first and second components are peptides or 
polypeptides, the ligation product will have the formula: 

J 1 -CONaH-CH(Z2)-C(0)-J2 X 
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Exemplary Rl subtituents for Formula I are depicted in Table I. 
Table I 



Formula 1 



HNa CH C- 

\ 1! 



ft 1 Snhsritnent Gronns for Formula 1 (CI included for referenced 
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Exemplary Rl, R2 and R3 substituents for Formula II are depicted in Table 



HNa CH C J 2 



c 3 —c{ 

/ \ 

R 3 K 2 



Rl. R9. and R^ Siiihstirnents (C.\ inr.lndeH for rpihrenr.e:") 



C 3 — C 2 




As with the N-substituted 2 carbon chain compounds, positioning of the 
benzyl and picolyl electron-donating substituents Rl', R3' and R5' in the ortho or 
para positions is necessary to maintain electronic conjugation to the CI carbon for 
robust cleavage of the Na-Cl bond following ligation. However, when R2 and R3 
form a benzyl group with C2 and C3, at least one of Rl' and R3' comprises a strong 
electron donating group, where Rl' or R3' is selected from methoxy (-OCH3), thiol 
(-SH), hydroxyl (-OH), and thiomethyl (-SCH3). For the N-substituted 3 carbon 
chain thiols in which R2 and R3 are hydrogens, Rl comprises a benzyl or picolyl 
group in which Rl', R3' and R5' include either strong or moderate electron-donating 
groups, or a combination thereof. As with the N-substituted 2 carbon chain alkyl or 
aryl thiols, the strong electron-donating groups enhance the sensitivity of the 3 
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carbon chain alkyl or aryl thiol to cleavage following ligation. Thus a particular 
electron-donating group or combination thereof can be selected accordingly. 

Similar to the N-substituted 2 carbon chain compounds, the N-substituted 3 
carbon chain compounds of the present invention may include a thiol as a substituent 
5 of Rl in the Rl' and R5' positions when available for substitution in a construct of 
interest. Here again the electron-donating thiol group is conjugated to CI and its 
introduction at these locations enables the compounds to have two routes for the 6- 
member ring forming ligation event. It also increases the local concentration of 
available thiols for reacting with the a-carboxy thioester, and provides for additional 

1 0 conformations in terms of structural constraints that can improve ligation. 

Synthesis of the N-terminal N-substituted 2 or 3 carbon chain alkyl or aryl 
thiol amino acids of the invention can carried out as described herein, for example, 
in Scheme I and Scheme II, and in accordance with standard organic chemistry 
techniques known in the art. See, e.g., "Advanced Organic Chemistry, Reactions, 

15 Mechanisms, and Structure," 4 th Edition, J. March (Ed.), John Wiley & Sons, New 
York, NY, 1992; "Comprehensive Organic Transformations, A Guide to Functional 
Group Preparations," R. Larock (Ed.), VCH Publishers, New York, NY, 1989. They 
may be synthesized in solution, by polymer-supported synthesis, or a combination 
thereof. The preferred approach employs N alpha protected N alkylated S-protected 

20 amino alkyl- or aryl- thiol amino acid precursors. The reagents utilized for synthesis 
can be obtained from any number of commercial sources. Also, it will be well 
understood that the starting components and various intermediates, such as the 
individual amino acid derivatives can be stored for later use, provided in kits and the 
like. 

25 In preparing the N-terminal Na-substituted 2 or 3 carbon chain alkyl or aryl 

thiol amino acids of the invention, protecting group strategies are employed. The 
preferred protecting groups (PG) utilized in the various synthesis strategies in 
general are compatible with Solid Phase Peptide Synthesis ("SPPS"). In some 
instances, it also is necessary to utilize orthogonal protecting groups that are 

30 removable under different conditions. Many such protecting groups are known and 
suitable for this purpose (See, e.g., "Protecting Groups in Organic Synthesis", 3rd 
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Edition, T.W. Greene and P.G.M. Wuts, Eds., John Wiley & Sons, Inc., 1999; 
NovaBiochem Catalog 2000; "Synthetic Peptides, A User's Guide," G.A. Grant, Ed., 
W.H. Freeman & Company, New York, NY, 1992; "Advanced Chemtech Handbook 
of Combinatorial & Solid Phase Organic Chemistry," W.D.. Bennet, J.W. 
5 Christensen, L.K. Hamaker, M.L. Peterson, M.R.Rhodes, and H.H. Saneii, Eds., 
Advanced Chemtech, 1998; "Priciples of Peptide Synthesis, 2nd ed.," M. 
Bodanszky, Ed., Springer-Verlag, 1993; "The Practice of Peptide Synthesis, 2nd 
ed.," M. Bodanszky and A. Bodanszky, Eds., Springer-Verlag, 1994; and "Protecting 
Groups," P.J. Kocienski, Ed., Georg Thieme Verlag, Stuttgart, Germany, 1994). 

10 Examples include benzyloxycarbonyl (Z), Boc, Bpoc, Trt, Nps, FmocCl-Z, Br-Z; 
NSC; MSC, Dde, etc. For sulfur moieties, examples of suitable protecting groups 
include, but are not limited to, benzyl, 4-methylbenzyl, 4-methoxybenzyl, trityl, 
ACM, TACAM, xanthyl, disulfide derivatives, picolyl, and phenacyl. 

More particularly, the Ncc-substituted 2 or 3 carbon chain alkyl or aryl thiols 

15 can be prepared in accordance with Scheme I (Solid-Phase preparation of the Noc- 
substituted precursor), Scheme II (Solution-Phase preparation of the Ncc-substituted 
precursor ). In Scheme I, Na-substituted 2 or 3 carbon chain alkyl or aryl thiols are 
assembled directly on the solid phase using standard methods of polymer-supported 
organic synthesis, while the Noc-protected, N-alkylated, S-protected, aminoalkyl or 

20 arylthiol amino acid precursor of Scheme II are coupled to the resin using standard 
coupling protocols. Where racemic or diastereomeric products are produced, it may 
be necessary to separate these by standard methods before use in extended native 
chemical ligation. 

25 
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The a-carboxythioesters can be generated by chemical or biological methods 
following standard techniques known in the art, such as those described herein, 
including the Examples. For chemical synthesis, a-carboxythioester peptides can be 
synthesized in solution or from thioester-generating resins, which techniques are 
5 well known (See, e.g., Dawson et al., Science (1994) 266:776-779; Canne et al. 
Tetrahedron Lett. (1995) 36:1217-1220; Kent, et al., WO 96/34878; Kent, et al., WO 
98/28434; Ingenito et al., JACS (1999) 121(49):1 1369-1 1374; and Hackeng et al, 
Proc. Natl. Acad. Sci. U.S.A. (1999) 96:10068-10073); Amiato et al, supra.). For 
instance, chemically synthesized thioester peptides can be made from the 

10 corresponding peptide a-thioacids, which in turn, can be synthesized on a thioester- 
resin or in solution, although the resin approach is preferred. The peptide-cc- 
thioacids can be converted to the corresponding 3-carboxy-4-nitrophenyl thioesters, 
to the corresponding benzyl ester, or to any of a variety of alkyl thioesters. All of 
these thioesters provide satisfactory leaving groups for the ligation reactions, with 

15 the 3-carboxy-4-nitrophenyl thioesters demonstrating a somewhat faster reaction rate 
than the corresponding benzyl thioesters, which in turn may be more reactive than 
the alkyl thioesters. As another example, a trityl-associated mercaptoproprionic acid 
leucine thioester-generating resin can be utilized for constructing C-terminal 
thioesters (Hackeng et al, supra). C-terminal thioester synthesis also can be 

20 accomplished using a 3-carboxypropanesulfonamide safety-catch linker by 
activation with diazomethane or iodoacetonitrile followed by displacement with a 
suitable thiol (Ingenito et al, supra; Bertozzi et al.). 

C-terminal a-carboxythioester peptides also can be made using biological 
processes, such as intein-mediated biological techniques (See, e.g., Chong et al, 

25 Gene (1997) 192:277-281; Chong et al, Nucl. Acids Res. (1998) 26:5109-5115; 
Evans et al. Protein Science (1998) 7:2256-2264; and Cotton et al. Chemistry & 
Biology (1999) 6(9):247-256). For instance, intein expression systems, with or 
without labels such as affinity tags can be utilized to exploit the inducible self- 
cleavage activity of an 'intein' protein-splicing element to generate a C-terminal 

30 dithiothreitol (DTT) ester peptide or polypeptide segment. In particular, the intein 
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undergoes specific self-cleavage in the presence of thiols such as DTT, b- 
mercaptoethanol or cysteine, which generates a peptide segment bearing a C- 
terminal thioester. See, e.g., Chong et al., (1997) supra; Chong et all, (1998) supra; 
Evans et al., supra; and Cotton et al. 5 supra. However, when using a recombinantly 
5 produced segment, the chemically synthesized portion of the final construct will 
typically contain the polymer modification where included. 

Ligation of the N-substituted 2 or 3 carbon chain alkyl or aryl thiol 
components of the invention with the first carboxythioester component generates a 
ligation product having an N-substituted amide bond at the ligation site. The 

10 ligation conditions of the reaction are chosen to maintain the selective reactivity of 
the thioester with the N-substituted 2 or 3 carbon chain alkyl or aryl thiol moiety. In 
a preferred embodiment, the ligation reaction is carried out in a buffer solution 
having pH 6-8, with the preferred pH range being 6.5-7.5. The buffer solution may 
be aqueous, organic or a mixture thereof. The ligation reaction also may include one 

15 or more catalysts and/or one or more reducing agents, lipids, detergents, other 
denaturants or solubilizing reagents and the like. Examples of preferred catalysts are 
thiol and phosphine containing moieties, such as thiophenol, benzylmercaptan, 
TCEP and alkyl phosphines. Examples of denaturing and/or solubilizing agents 
include guanidinium, urea in water or organic solvents such as TFE, HTTP, DMF, 

20 NMP, acetonitrile admixed with water, or with guanidinium and urea in water. The 
temperature also may be utilized to regulate the rate of the ligation reaction, which is 
usually between 5°C and 55°C, with the preferred temperature being between 15°C 
and 40°C. As an example, the ligation reactions proceed well in a reaction system 
having 2% thiophenol in 6M guanidinium at a pH between 6.8 and 7.8. 

25 For the N-substituted 2 carbon chain alkyl or aryl thiols, the ligation event 

results from a thiol exchange that occurs between the COSR thioester component 
and the amino alkyl thiol component. The exchange generates a thioester-linked 
intermediate ligation product that after spontaneous rearrangement through a 5- 
membered ring intermediate generates a first ligation product of the formula Jl-HN- 

30 CH(Zl)-C(0)-Na(Cl(Rl)-C2-SH)-CH(Z2)-J2 having a removable N-substituted 2 
carbon chain alkyl or aryl thiol [HS-C2-C1(R1)-] at the ligation site, where the 
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substituents are as defined above. The N-substituted 2 carbon chain alkyl or aryl 
thiol [HS-C2-C1(R1)-] at the ligation site is amenable to being removed, under 
peptide-compatible conditions, to generate a final ligation product of the formula Jl- 
HN-CH(Zl)-CO-NH-CH(Z2)-CO-J2 having a native amide bond at the ligation site. 
5 For the N-substituted 3 carbon chain aryl or alkyl thiols, the thiol exchange 

between the COSR thioester component and the amino alkyl thiol component 
generates a thioester-linked intermediate ligation product that after spontaneous 
rearrangement through a 6-membered ring intermediate generates a first ligation 
product of the formula Jl-HN-CH(Zl)-C(0)-Na(Cl-C2(R2)-C3(R3)-SH)-CH(Z2)- 

10 J2 having a removable N-substituted 3 carbon chain alkyl or aryl thiol [HS-C3(R3)- 
C2(R2)-C1(R1)-] at the ligation site. The N-substituted 3 carbon chain aryl thiol 
[HS-C3(R3)-C2(R2)-C1(R1)-] at the ligation site is amenable to being removed, 
under peptide-compatible conditions, to generate a final ligation product of the 
formula Jl-HN-CH(Zl)-CO-NH-CH(Z2)-CO-J2 having a native amide bond at the 

15 ligation site. 

Removal of the N-substituted alkyl or aryl thiol group is preferably 
performed in acidic conditions to facilitate cleavage of the N-Cl bond, yielding a 
stabilized, unsubstituted amide bond at the ligation site. By "peptide-compatible 
cleavage conditions" is intended physical-chemical conditions compatible with 

20 peptides and suitable for cleavage of the alkyl or aryl thiol moiety from the ligation 
product. Peptide-compatible cleavage conditions in general are selected depending 
on the a-substituted compound employed, which can be readily deduced through 
routine and well known approaches (See, e.g., "Protecting Groups in Organic 
Synthesis", 3rd Edition, T.W. Greene and P.G.M. Wuts, Eds., John Wiley & Sons, 

25 Inc., 1999; NovaBiochem Catalog 2000; "Synthetic Peptides, A User's Guide," G.A. 
Grant, Ed., W.H. Freeman & Company, New York, NY,1992; "Advanced Chemtech 
Handbook of Combinatorial & Solid Phase Organic Chemistry," W.D.. Bennet, J.W. 
Christensen, L.K. Hamaker, M.L. Peterson, M.R.Rhodes, and H.H. Saneii, Eds., 
Advanced Chemtech, 1998; "Priciples of Peptide Synthesis, 2nd ed.," M. 

30 Bodanszky, Ed., Springer-Verlag, 1993; "The Practice of Peptide Synthesis, 2nd 
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ed.," M. Bodanszky and A. Bodanszky, Eds., Springer-Verlag, 1994; and "Protecting 
Groups," P.J. Kocienski, Ed., Georg Thieme Verlag, Stuttgart, Germany, 1994). 

For example, where the Rl, R2 or R3 substituents comprises a methoxy, 
hydroxyl, thiol or thiomethyl, methyl and the like, the more universal method for 
5 removal involves acidic cleavage conditions typical for peptide synthesis 
chemistries. This includes cleavage of the N-Cl bond under strong acidic conditions 
or water-acidic conditions, with or without reducing reagents and/or scavenger 
systems (e.g., acid such as anhydrous hydrogen fluoride (HF), triflouroacetic acid 
(TFA), or trimethylsulfonyl fiouroacetic acid (TMSFA) and the like). More specific 

10 acidic cleavage systems can be chosen to optimize cleavage of the Na-Cl bond to 
remove the aryl or alkyl thiol moiety for a given construct. Such conditions are well 
known and compatible with maintaining the integrity of peptides. A thiol scavenger 
may be included, particularly where tryptophans are present in a peptide or 
polypeptide sequence to avoid reaction of the tryptophan side chain with the 

15 liberated aryl or alkyl thiol moiety. Examples of thiol scavengers include ethanediol, 
cysteine, beta-mercaptoethanol and thiocresol. . 

Other specialized cleavage conditions include light or reductive-cleavage 
conditions when the picolyl group is the substituent. As an example, when the Rl, 
or R2 and R3 substituents comprise a picolyl moiety, photolysis (e.g., ultraviolet 

20 light), zinc/acetic acid or electrolytic reduction may be used for cleavage following 
standard protocols. Where Rl of the N-substituted 2 carbon chain thiol comprises a 
thiomethane at Rl, the mercury or HF cleavages can be used. The cleavage system 
also can be used for simultaneous cleavage from a solid support and/or as a 
deprotection reagent when the first or second ligation components comprise other 

25 protecting groups. 

In one embodiment of the present invention Na-substituted 2 or 3 chain alkyl 
or aryl thiols are employed in the peptide synthesis step (particularly in automated 
peptide synthesis and orthogonal and convergent ligation strategies) to yield properly 
N-terminally derivatized polypeptides that can be used as substrates for extended 

30 chemical ligation to yield synthetic bioactive proteins. Such compounds comprise a 
fully protected, partially protected or fully unprotected acid stable Na-substituted 2 
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or 3 carbon chain amino alkyl or aryl thiol of the formula (PG1)S-C2-C1(R1)- 
Na(PG2)-CH(Z2)-C(0) -J2 or (PG1)S-C3(R3)-C2(R2)-C l(Rl)-Na(PG2)-CH(Z2)- 
C(0)-J2 3 which are depicted below in Table III and Table IV. In particular, one or 
more of Rl, R2 and R3 comprises an electron donating group conjugated to CI that, 
5 following conversion of the Not-substituted amino alkyl or aryl thiol to an Nct- 
substituted amide alkyl or aryl thiol, is capable of forming a resonance stabilized 
cation at CI that facilitates cleavage of the Na-Cl bond under peptide compatible 
cleavage conditions. PG1 and PG2 are protecting groups that are present 
individually or in combination or are absent and can be the same or different, where 

10 Z2 is any chemical moiety compatible with chemical peptide synthesis or extended 
native chemical ligation, and where J2 is any chemical moiety compatible with 
chemical peptide synthesis or extended native chemical ligation. PG1 (or XI) is a 
group for protecting the amine. PG2 (or X2) is a group for protecting the thiol. 
Many such protecting groups are known and suitable for this purpose (See, e.g., 

15 "Protecting Groups in Organic Synthesis", 3rd Edition, T.W. Greene and P.G.M. 
Wuts, Eds.., John Wiley & Sons, Inc., 1999; NovaBiochem Catalog 2000; "Synthetic 
Peptides, A User's Guide," G.A. Grant, Ed., W.H. Freeman & Company, New York, 
NY,1992; "Advanced Chemtech Handbook of Combinatorial & Solid Phase Organic 
Chemistry," W.D.. Bennet, J.W. Christensen, L.K. Hamaker, M.L. Peterson, 

20 M.RRhodes, and H.H. Saneii, Eds., Advanced Chemtech, 1998; "Priciples of 
Peptide Synthesis, 2nd ed.," M. Bodanszky, Ed., Springer-Verlag, 1993; "The 
Practice of Peptide Synthesis, 2nd ed.," M. Bodanszky and A. Bodanszky, Eds., 
Springer-Verlag, 1994; and "Protecting Groups," P.J. Kocienski, Ed., Georg Thieme 
Verlag, Stuttgart, Germany, 1994). 
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Examples of preferred protecting grouj 


)S for PG1 and XI include, but are 



limited to [Boc(t-Butylcarbamate), Troc(2,2,2,-Trichloroethylcarbamate), Fmoc(9- 
Fluorenylmethylcarbamate), Br-Z or Cl-Z(Br- or Cl-Benzylcarbamate), Dde(4,4,- 
dimethyl-2,6-dioxocycloex 1 -ylidene), MsZ(4-Methy]sulfinylbenzylcarbamate), 

5 Msc(2-Methylsulfoethylcarbamate) Nsc(4-nitrophenylethylsulfonylethyloxy- 
carbonyl]. Preferred PG1 and XI protecting groups are selected from "Protective 
Groups in Organic Synthesis," Green and Wuts, Third Edition, Wiley-Interscience, 
(1999) with the most preferred being Fmoc and Nsc. Examples of preferred 
protecting groups for PG2 include, but are not limited to [Acm(acetamidomethyl), 

10 MeoBzl or Mob(p-Methoxybenzyl), Meb(p-Methylbenzyl), Trt(Trityl), 
Xan(Xanthenyl), tButhio(s-t-butyl), Mmt(p-Methoxytrityl), 2 or 4 PicolyI(2 or 4 
pyridyl)), Fm(9-Fluorenylmethyl), tbut(t-Butyl), Tacam(Trimethylacetamidomethyl)] 
Preferred protecting groups PG2 and X2 are selected from "Protective Groups in 
Organic Synthesis," Green and Wuts, Third Edition, Wiley-Interscience, (1999), with 

15 the most preferred being Acm, Mob, MeB, Picolyl . 
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The protected forms of the Na-substituted 2 or 3 chain alkyl or aryl thiols of 
the invention can be prepared as in Schemes I and II above. 

The compounds of the present invention may be produced by any of a variety 
5 of means, including halogen-mediated amino alkylation, reductive amination, and by 
the preparation of Na-protected, N -alkylated, S-protected, amino alkyl- or aryl- thiol 
amino acid precursors compatible with solid phase amino acid synthesis methods. 
When desirable, resolution of the racemates or diastereomers produced to give 
compounds of acceptable chiral purity can be carried out by standard methods. 

10 III. Preferred Water-soluble Polymers of the Invention and Production 

As used herein, the term "molecular heterogeneity" is intended to refer to a 
variation in the number of polymer molecules attached to each protein of a protein 
preparation. The term "molecular diversity" is intended to refer to (a) a variation in 
the site(s) of the protein that are modified by the polymer, (b) a variation in the 
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length of the polymer adducts of different sites of the same protein, or of the same 
site(s) in different proteins of a protein preparation, or (c) a variation in the extent 
and/or nature of any branching of the polymer adducts of different sites of the same 
protein, or of the same site(s) in different proteins of a protein preparation. As used 

5 herein, the term "molecularly homogeneous" is intended to refer to a preparation of a 
protein in which all of the protein molecules contain the amino acid sequence and 
the same polymer modifications at the same positions. A mixture of two or more 
"molecularly homogeneous" protein preparations is referred to herein as a 
"molecularly defined" preparation. 

10 In accordance with this aspect of the invention, a solution to the above- 

identified problems of polymer heterogeneity, diversity, and unsuitability involves 
the production of a new class of biocompatible polymers which combine the 
advantages of both polypeptides (precise length, convenient synthesis) and "pPEG" 
("precision PEG"), a flexible, amphiphilic, non-immunogenic, polymer not 

15 susceptible to proteases) Rose, K. et al. (U.S. Patent Application Serial No. 
09/379,297, herein incorporated by reference). This new class of biocompatible 
polymer has the formula: 

-[CO-X-CO-NH-Y-NH] n - 

n is an integer, preferably from 1-100 and more preferably from 2-100, where X and 
20 Y are biocompatible repeat elements of precise structure linked by an amide bond. 
Preferably, X and Y will be divalent organic radicals lacking reactive functional 
groups or are absent and are the same or different, and can vary independently with 
each repeating unit (n). Preferably, when n=2, at least one of X or Y will be selected 
from the group consisting of a substituted, unsubstituted, branched and linear 
25 aliphatic and aromatic group. More preferably, one of X or Y will be selected from 
the group consisting of phenyl, a Cj-Cio alkylene moiety, a C1-C10 alkyl group, a 
heteroatom-containing phenyl, a heteroatom -containing C^-Cio alkylene moiety, a 
heteroatom-containing Cj-Ciq alkyl group, and a combination thereof. 
Particularly preferred pPEG moieties have the formulae: 
30 -{CO-(CH 2 ) 2 -CO- NH-(CH 2 )3-(OCH 2 CH2)3-CH2-NH} n - 
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where n preferably varies from 1-100 and more preferably from 2-100; or 
-{CO-(CH 2 )2-CO-NH-(CH 2 )6-NH-CO-(CH2)2-CO-NH-(CH 2 )3-(OCH2CH 2 )3-CH 2 -NH-} n - ) 
where n preferably varies from 1-50 and more preferably from 2-50. 

Such pPEG moieties can be synthesized in any of a variety of ways. Such 

5 moieties are, however, preferably produced using a solid phase stepwise chain 
assembly of units, rather than a polymerization process. The use of such an 
assembly process permits the moieties of a preparation to have a defined and 
homogeneous structure, as to their length, the nature of their X and Y substituents, 
the position(s) (if any) of branch points, and the length, X and Y substituents, and 

1 0 position(s) of any branches. 

Preferably, such moieties will be synthesized by steps such as: 

(a) acylating the amino or hydroxyl group of a compound of the formula 
Z-Q-support with a molar excess of a derivative of a diacid having 
the formula, HOOC-X-COOH, where Z is H2N- or HO-; Q is a linker 

15 or a target molecule; and the support is a solid phase, matrix or 

surface; 

(b) activating the free carboxyl group of the product of step(a); 

(c) aminolysing the product of step (b) with a molar excess of a diamine 
having the formula, NH2-Y-NH2; and 

20 (d) optionally repeating steps (a) - (c) using HOOC-X-COOH and NH2- 

Y-NH2, where said X and Y are divalent organic radicals or are 
absent and are the same or different, and can vary independently with 
each of said optionally repeated units, and are the same or different 
from the X and Y substituents used in any of the previous acylating 

25 and aminolysing steps. 

In preferred embodiments, 6-mers, 12-mers, 18-mers and 32-mers of above 
repeat unit are employed. Where desired, the repeat unit can be used, for example, 
in conjunction with the amino group of lysine to form branched pPEG structures. 
The pPEG may be attached to the synthetic proteins of the present invention by a 
30 variety of chemistries, including thioether, oxime and amide linkage formation. 
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In one embodiment, the solid phase stepwise chain assembly of units comprises: 

Step 1: Couple protected or unprotected diacid to amino on resin to generate 

amide bond at linkage site (where PG is a protecting group that is present or absent 

depending on diacid employed): 
5 PG-OOC-X-COOH + NH 2 -Y-NH-Resin 

Step 2: Remove protecting group (PG) on resin, if present 
HOOC-X-CO-NH-Y-NH-Resin 

Step 3: Couple protected or unprotected diamino to carboxy on resin to 
generate amide bond at linkage site (where PG is present or absent depending on 
10 diamino employed) 

PG-NH-Y-NH + HOOC-X-CO-NH-Y-NH-Resin 
Step 4: Remove protecting group (PG) on resin, if present 
-NH-Y-NH-OC-X-CO-NH-Y-NH-Resin 

Step 5: Repeat steps 1-4 ! n' times to add 'n' units then cleave from resin 
15 -[CO-X-CO-NH-Y-NH]-[CO-X-CO-NH-Y-NH]-[CO-X-CO-NH-Y-NH]-[CO-X-CO-NH-Y-NH]- 

As discussed, linear and branched pPEG constructs are preferred water- 
soluble polymers for attachment to the synthetic bioactive molecules of the 
invention. The pPEGs employed bear pendant groups that are charged or neutral 
under physiological conditions, and can be made to vary in attachment chemistry, 

20 length, branching and solubility depending on the pPEG structure one employs. As 
noted above, preferred pPEGs of the invention comprise a water-soluble polyamide 
having the repeat unit -CO-X-CO-NH-Y-NH-, where one or both of X and Y 
comprises a water-soluble repeat unit, and most preferably a 'PEG'-based repeat 
unit. Although oligoureas are in principle accessible using a simple two-step solid 

25 phase procedure, as shown below for the case of an amino resin, NH2-Resin, and a 
symmetrical diamine, NH2-Y-NH2: 

Activation with carbonyldiimidazole -> im-CO-NH-Resin 
Aminolysis with diamine NH 2 -Y-NH 2 NH 2 -Y-NH-CO-NH-Resin 
where these two steps may be repeated a number of times to give an oligourea with 

30 repeat unit -NH-Y-NH-CO-, this approach is less preferred. This is because the 
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yields of the above steps may be non-quantitative at room temperature, even with 
very large excesses of reagents and long reaction times. Accordingly, it is preferable 
to use a three-step solid phase procedure, shown below for the case of an amino 
resin, NH2-Resin, to form a polyamide. The Reagents HO2C-X-CO2H and NH2-Y- 
5 NH2 should be symmetrical in order to avoid isomeric products. 

Acylation with diacid HO2C-X-CO2H ■» HO-CO-X-CO-NH-Resin 
Activation with carbonyldiimidazole -> im-CO-OCO-X-CO-NH-Resin 
Aminolysis with diamine NH2-Y-NH2 NH 2 -Y-NH-CO-X-CO-NH-Resin 
These three steps may be repeated in sequence a number of times to give a 

10 polyamide with repeat unit -NH-Y-NH-CO-X-CO-. The polymer contains a precise 
number of monomer units, X and Y can be varied independently at each step, and 
end-groups can be chosen at will. For example, by using succinic anhydride 
("Succ") for the acylation step and 4,7,10-trioxa-l,13-tridecanediamine (also 
referred to as "EDA" or "TTD") for the aminolysis step, 'PEG'-based polyamides 

15 are formed wherein X is -CH 2 CH 2 -, Y is -NH-CH 2 CH2CH2-(OCH 2 CH2)3-CH2- 
NH- and the repeat unit is -NH-CH2CH 2 CH2-(OCH 2 CH2)3-CH2-NH- 
COCH2CH2CO-. In spite of the fact that the procedure involves divalent reagents 
with no protecting groups, cross-linking is not a problem when standard commercial 
peptide synthesis resins are used (Rose et al., (U.S. Patent Application Serial No. 

20 09/379,297); and Rose et al., J. Am. Chem. Soc. (1999) 121:7034), except as noted 
below for the case of branched Lys cores for making branched constructs. 

For example, branched pPEG constructs can be made in a similar manner as 
for the "chemobody" constructs described in Rose et al., (U.S. Patent Application 
Serial No. 09/379,297) and Rose, K. and Vizzavona, J. (J. Am. Chem. Soc. (1999) 

25 121:7034). Thus, branched pPEG constructs can be made to have a branching core 
such as a lysine branching core, which is coupled through oxime linkers to a 
preferred water-soluble polyamide such as -(COCF^C^CO-NH-CB^CH^CF^- 
(OCH2CH2)3-CH2-NH) n -. For instance, oxime bonds can readily formed between 
an aminooxyacetyl group on a Lys side chain at the other extremity of the 
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polyamide, and glyoxylyl groups on a lysine core, for example the tetrameric core 
(0=CHCO)4Lys2Lys-. Thus an oxime linker off of each lysine branch point can be 
prepared as the structure -Lys(COCH20N=CHCO-)amide-. An alternative 
construction can place the oxime bond between the polyamide and the free pendant 
5 group of the polyamide, preferably using a monoprotected diamine and deprotection 
after coupling the polyamide, to generate a tetravalent branched constructed depicted 
below: 

[0=CHCO-(NH-CH 2 CH2CH2-[OCH2CH2]3-CH2-NH-COCH2CH 2 CO--) n ]4Lys2Lys- 
Oxime chemistry can be used in making not only dimeric and tetrameric 

10 branched constructs, but it also suitable for assembling octameric branched 
constructs (Rose, K., J. Am. Chem. Soc. (1994) 116:30; Rose, K. et al, Bioconj. 
Chem. (1996) 7:552). It is, of course, possible to use other chemistries for such 
purposes when using such pPEG polyamides (See, e.g., Figure 15). Moreover, 
polyamide formation may be incorporated into a synthetic scheme for peptide 

15 synthesis involving Boc or Fmoc chemistry, but when elaborating such schemes it 
must be borne in mind that the aminolysis step will remove the Fmoc group, and will 
remove formyl protection of indole if Boc chemistry is to be used. 

Accordingly, such pPEGs can be made to have various branching cores (e.g., 
Lysine branching core etc.) linked through a bond of choice (e.g., amide, thioether, 

20 oxime etc.) to a linear water-soluble polyamide (e.g., -(Succ-TTD),,-, etc.), where 
the free end of each linear water-soluble polyamide can each be capped with a 
desired pendant group (e.g., carboxylate, amino, amide etc.) to provide a desired 
charge. Moreover the linear and branched pPEGs of the present invention can be 
made to comprise a unique functional group for attachment to a synthetic protein 

25 bearing one or more unique and mutually reactive functional groups, and the pendant 
groups of the pPEGs will preferably be non-antigenic (See, e.g., Figure 15). 

In a particularly preferred embodiment, the invention is directed to 
molecularly homogeneous glyco-mimetic water-soluble polymers of the formula: 



U-sl-B-s2~Polymer-s3-J* 
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where U is a residue of a unique functional group, B is a branching core having three 
or more arms that may be the same or different and may be present or absent, 
Polymer is a polyamide having a molecular weight greater than about 5,000 Da of 
the formula -[C(0)-X-C(0)-NH-Y-NH]n- or -[NH-Y-NH-C(0)-X-C(0)]n-, where 
5 X and Y are divalent radicals that may be the same or different and may be branched 
or linear, and n is a discrete integer from 2 to 50, and where either or both of X and 
Y comprises a substantially non-antigenic water-soluble repeat unit that may be 
linear or branched, J* is a residue of a substantially non-antigenic pendant group 
having a net charge under physiological conditions selected from the group 

10 consisting of negative, positive and neutral, and where si, s2, and s3 are spacer or 
linker moieties that may be the same or different, and may be individually present or 
absent. Such polymers include those described in US. Patent Application Serial No. 
60/236,377, which is incorporated herein in its entirety. Formula U-sl-B-s2- 
Polymer-s3-J* also can be represented by the formula U-B-Polymer-J*, where 

15 spacer or linker groups si, s2, s3 may be present or absent. 

The preferred process of forming the preferred glycomimetic polymers of the 
invention are illustrated in Figures 5-7. Figures 5A - 5B depict solid phase 
process for generating the branching core (B) and unique chemoselective functional 
group (U) of the water-soluble polymer U-B-Polymer-J* of the invention. The 

20 process may be carried out in solution, although the solid phase approach as shown 
is preferred. In particular, Figure 5A shows orthogonally protected U-B precursor 
moiety with reactive group ( ^ ) and the basic geometric structure of such 
construct is depicted by dots linked with bonds ( • — • ); this basic geometric 
structure is not intended to limit that types of chemical linkages or groups employed, 

25 but merely illustrative of the relative points of geometry for building structures and 
chemical elaboration points suitable for generation of U-B moieties of the invention. 
The orthogonally protected U-B precursor is coupled to a polymer support/resin 
comprising a suitable cleavable linker and co-reactive group ( V ) following 
activation that is capable of covalent linkage to the U-B precursor: 
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Polymer Support 



Linker 

This system employs the principles of polymer-supported organic chemistry. 
Following coupling the branching core is elaborated (only a first branch point 
shown, and additional branch points may be present or absent) as illustrated to 

5 generate a branching core that is suitable for subsequent attachment of a desired 
Polymer component, such as a substantially non-antigenic, water-soluble linear 
polymer. Also shown is the U group, which can be provided at the outset of 
synthesis as part of the orthogonally protected U-B precursor, or elaborated during or 
after elaboration of the branching core B. While attachment of the Polymer 

10 component can be achieved on-resin, i.e., prior to cleavage, a preferred route is 
cleavage of the U-B moiety from the polymer support / resin so as to generate a U-B 
core that can be purified for subsequent attachment the Polymer component. 

As illustrated, the pendant branch points of core group B are built to 
comprise a functional group (Func), which can be the same or different, and may be 

15 reversibly protected (PG, PG' or PG") or unprotected. In each case, the final step 
for attachment of the Polymer component involves the generation of a functional 
group (Fund) at the pendant branch points, and generation of group U (See Figure 
7). Figure 5B depicts an alternative process in which a protected U-B precursor is 
employed in combination with a polymer support bearing a linker, which upon 

20 cleavage generates the desired protected or unprotected U-group. Figure 5B also 
depicts attachment of a pre-assembled branching core B to a polymer support, and 
use of a U-group generating resin to make the U-B moiety for subsequent attachment 
of the Polymer component. 

Figures 6A - 6D depict a solid phase process for generating preferred 

25 substantially non-antigenic water-soluble polyamide Polymer-J* components of the 
invention for subsequent attachment to the U-B core. Although the solid phase 
process is illustrated, which is the preferred process, a solution phase process can be 
adapted to achieve the same end result. In Figures 6 A and 6B, diacid and diamino 
units are coupled using the principles of solid phase organic chemistry. Optionally, 
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protected amino-X'-acid and /or amino-Y'-acid units can be incorporated for 
additional diversity of the groups X and Y in the final cleavage product having a 
polyamide structure of the formula -[NH-Y-NHCO-X-CO]-. Figure 6A depicts 
synthesis in the N- to C-terminal direction, whereas Figure 6B depicts synthesis in 

5 the C- to N-terminal direction. In Figures 6C and 6D, protected amino-X'-acid and 
/or amino-Y'-acid units are coupled using the principles of solid phase organic 
chemistry. Figure 6C depicts synthesis in the N- to C-terminal direction, whereas 
Figure 6D depicts synthesis in the C- to N-terminal direction. As apparent from 
Figures 6A-6D, the nature of the final polyamide products can be precisely 

10 controlled, which depends on the number of cycles one carries out for synthesis. 
Moreover, the pendant group J* can be built to virtually any user-defined 
specification. Where mono-disperse repeat units X, Y, X' and Y' are employed, the 
exact molecular structure of the final polyamide product can be precisely controlled. 
A preferred process for coupling the U-B component to Polymer-J* 

1 5 component to generate the preferred synthetic polymer constructs of the invention of 
the formula U-B-Polymer-J* is depicted in Figure 7. As illustrated, various 
protecting groups can be provided, and are optional depending on the intended end 
use of a given construct. Also illustrated different routes to produce the desired U- 
B-Polymer-J* constructs, including a solid phase approach and a solution phase 

20 approach. 

As noted above, preferred water-soluble repeat units comprise a polyalkylene 
oxide, polyamide alkylene oxide, or derivatives thereof. The most preferred water- 
soluble repeat unit comprises an ethylene oxide of the formula -(CH 2 -CH 2 -0)- or - 
(CH2-CH2-O)-. The number of such water-soluble repeat units can vary 

25 significantly, but the more preferred number of such units is from 2 to 500, 2 to 400, 
2 to 300, 2 to 200, 2 to 100, 2 to 50, and more preferably 2 to 32, with 3 to 8 being 
the most preferred. An example of a more preferred embodiment is where one or 
both of X and Y is selected from: -((CH 2 )ni-(CH2-CH 2 -0) n2 -(CH 2 )ni-)- or-((CH 2 )„i- 
(0-CH 2 -CH 2 ) n2 -(CH2)ni-), where nl is 1 to 6, 1 to 5, 1 to 4 and most preferably 1 to 

30 3, and where n2 is 2 to 50, 2 to 25, 2 to 15, 2 to 10, 2 to 8, and most preferably 2 to 
5. An example of a highly preferred embodiment is where X is -(CH 2 -CH 2 )-, and 



WO 02/20033 



PCT7US01/21930 



-82- 

where Y is -(CH 2 -(CH 2 -CH 2 -0) 3 -CH 2 -CH 2 -CH 2 )- or -(CH 2 -CH 2 -CH 2 -(0-CH 2 - 
CH 2 ) 3 -CH 2 )-. Even more preferred are mono-disperse compositions where each n of 
is a discrete integer. In a highly preferred embodiment, X is -(CH 2 -CH 2 )- and Y is - 
(CH 2 -(CH 2 -CH 2 -0) 3 -CH 2 -CH 2 -CH 2 )n- or -(CH 2 -CH2-CH 2 -(0-CH 2 -CH 2 )3-CH 2 )-n 
5 where n is 12 to 36. 

In particular, the Polymer will preferably have when the water-soluble 
polymer contains polymer chains having polyalkylene oxide repeat units, such as 
ethylene oxide repeat units, it is preferred that each chain have a molecular weight of 
between about 1500 and about 42,000 Da and preferably between about 2,000 to 
1 0 about 20,000 Da being the most preferred. 

A preferred U group comprises a moiety capable of chemical ligation. 
Preferred polymers are branched where group B comprises three or more arms. A 
preferred component J* comprises a group that is ionizable under physiological 
conditions, and most preferably one that is negatively charged. Either or both of U 
15 and J may comprise a protecting group that is capable of being removed without 
damaging the integrity of the construct. 

Preferred spacers or linkers include linear or branched moieties comprising 
one or more repeat units employed in a water-soluble polymer, diamino and or 
diacid units, natural or unnatural amino acids or derivatives thereof, as well as 
20 aliphatic moieties, including alkyl, aryl, heteroalkyl, heteroaryl, alkoxy, and the like, 
which preferably contain up to 18 carbon atoms or even an additional polymer chain. 

As noted above, component U is a residue of a functional group that is 
capable of being attached or is attached to a target molecule, such as a peptide or 
polypeptide, or other surface of interest. When U is a residue of a functional group 
25 for conjugation to a target molecule, U comprises a nucleophilic group or 
electrophilic group, and the target molecule comprises a mutually reactive 
electrophilic group or nucleophilic group, respectively. 

Many such mutually reactive functional groups are known and are capable of 
attachment to side chain functional groups common to peptides and polypeptides, or 
30 derivatized side chain functional groups (Zalipsky et al, Bioconjugate Chemistry 
(1995) 6:150-165; "Perspectives in Bioconjugate Chemistry", C.F. Meares, Ed., 
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ACS, 1993; "Chemistry of Protein Conjugation and Cross-Linking", S.S. Wong, Ed., 
CRC Press, Inc. (1993)). Examples of functional groups include groups capable of 
reacting with an amino group such as (a) carbonates such as the p-nitrophenyl, or 
succinimidyl; (b) carbonyl imidazole; (c) azlactones; (d) cyclic imide thiones; and 
5 (e) isocyanates or isothiocyanates. Examples of functional groups capable of 
reacting with carboxylic acid groups and reactive carbonyl groups include (a) 
primary amines; or (b) hydrazine and hydrazide functional groups such as the acyl 
hydrazides, carbazates, semicarbamates, thiocarbazates, aminooxy etc. Functional 
groups capable of reacting with mercapto or sulfhydryl groups include phenyl 

10 glyoxals, maleimides, and halogens. Examples of functional groups capable of 
reacting with hydroxyl groups such as (carboxylic) acids, or other nucleophiles 
capable of reacting with an electrophilic center, include hydroxyl, amino, carboxyl, 
thiol groups, active methylene and the like. 

For instance, the above polymer can be prepared to carry component U as a 

15 functional group for attachment to a target molecule, where the functional group is 
acrylate, aldehyde, ketone, aminooxy, amine, carboxylic acid, ester, thioester, 
halogen, thiol, cyanoacetate, dipalmitoyl phosphatidylethanolamine, distearoyl 
phosphatidylethanolamine, epoxide, hydrazide, azide, isocyanate, maleimide, 
methacrylate, nitrophenyl carbonate, orthopyridyl disulfide, silane, sulfhydryl, vinyl 

20 sulfones, succinimidyl glutarate, succimidyl succinate, succinic acid, tresylate and 
the like. U also may be provided in an activatable form, e.g., a carboxylic acid that 
can be converted to an active ester thereof that is capable of reacting with a 
nucleophile such as an amine. 

In a preferred embodiment, U is a residue of a unique functional group that is 

25 selectively reactive with a unique functional group on the target molecule. This 
aspect of the invention embodies the principles of peptide synthesis (protecting 
group strategies) and chemical ligation (partial or no protecting group strategies) as 
discussed above in Section II. Thus, U can represent a residue of a wide range of 
functional groups, such as those described above. Preferred U groups are amenable 

30 to amine capture strategies (e.g., ligation by hemiaminal formation, by imine 
formation, and by Michael addition), thiol capture strategies (e.g., ligation by 
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mercaptide formation, by disulfide exchange), native chemical ligation strategies 
(e.g., ligation by thioester exchange involving cysteine or thiol contain side-chain 
amino acid derivative), and orthogonal ligation coupling strategies (e.g., ligation by 
thiazolidine formation, by thioester exchange, by thioester formation, by disulfide 
5 exchange, and by amide formation)(See, e.g., Coltart, DM., Tetrahedron (2000) 
56:3449-3491). A particularly preferred U comprises a residue of a unique functional 
group employed in an aqueous compatible chemical ligation chemistry such as 
described above in Section II. Accordingly, U can be a functional group capable of 
forming a bond selected from the group consisting of carbonate, ester, urethane, 

10 orthoester, amide, amine, oxime, imide, urea, thiourea, thioether, thiourethane, 
thioester, ether, thaizolidine, hydrazone, oxazolidine and the like. Preferred bonds 
are oxime and amide bonds. 

As noted above, B is a branching core moiety having three or more arms, and 
may be present or absent. When B is present, one arm is joined to U or a spacer or 

15 linker attached to U, and each other arm is joined to a Polymer or a spacer or linker 
attached to a Polymer. Examples of branching cores B include, but are not limited 
to, amino, amide, carboxylic, and combinations thereof. These include oligoamides 
of lysine and the like, or oligomers prepared from alkanediamines and acrylic acid 
("polyamidoamines"), the later providing a net positive charge in the branching core. 

20 (See, e.g., Zeng et al., J. Pept. Sci. (1996) 2:66-72; Rose et al, Bioconjugate Chem., 
(1996) 7(5):552-556; NovoBiochem Catalog and Peptide Synthesis Handbook, 
Laufelfmgen, 2001; Wells et al, Biopolymers (1998) 47:381-396; Mahajan et al, 
(1999) 40:4909-49-12; Judson et al. (1998) 39:1299-1302; Swali et al, (1997) 
62:4902-4903; US Patent Nos. 5,932,462, 5,919,455, 5,643,575, 5,681,567). Many 

25 other different branching cores can be used and are suitable for this purpose, 
including substituted diamines, substituted diacids, alkyl acids such as glycerol and 
other moieties having three or more functional or activatable groups including 
multivalent alkyl, aryl, heteroalkyl, heteroaryl, and alkoxy moities and the like, and 
oligosaccharides (e.g., Nierengarten et al., Tetrahedron Lett. (1999) 40:5681-5684; 

30 Matthews et al, Prog. Polym. Sci. (1998) 1-56; Suner et al, Macromolecules (2000) 
33:253; Fischer et al, Angew. Cltem. Int. Ed. (1999) 38:884; Sunder et al, Adv. 
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Mater. (2000) 12:235; Mulders et al, Tetrahedron Lett. (1997) 38:631-634; Mulders 
et al, Tetrahedron Lett. (1997) 38:30-3088; and Turnbull et al, Chembiocehm 
(2000) l(l):70-74). The most preferred branching core linked to the Polymer 
component is joined by amide bonds. 
5 Preferred branching cores emanate from amino, carboxylic or mixed amino 

and carboxylic functionalities. Examples of preferred branching cores are lysine 
amino cores, aspartic or glutamic acid branching cores, and gamma-glutamic acid 
branching cores, respectively. Moreover, preferred branched polymer constructs are 
those in which branching emanates from a single branching core, such as an 
10 oligoamide or polyamidoamine core. However, other classes of branched constructs 
can be employed, such as worm-like structures in which the branching emanates 
from a sequence of multiple branching cores distributed along a polymer backbone 
(Schliiter et al, Chem. Int. Ed. (2000) 39:864). Depending on the chemical 
composition and/or the bulkiness of polymer backbone and repeat units, the resulting 
15 worm-like structures can be designed to be water soluble or prone to induce liquid 
crystalline organization (Ouali et al, Macromolecules (2000) 33:6185), which can 
be advantageous for delivery applications, stability and duration of action. As with 
the other branched polymer constructs of the invention, the worm-like constructs are 
made to contain a pendant functional group comprising U. 
20 The Polymer component, or one or more of the spacers or linkers, may 

include polymer chains or interspaced linkers or bonds that are biostable or 
biodegradable, for example as discussed above in Section I. As also noted above in 
Section I, component J* is a residue of pendant group having a net charge under 
physiological conditions selected from the group consisting of negative, positive and 
25 neutral. The most preferred is where J* comprises an ionizable carboxylate moiety 
and has a net negative charge under physiological conditions. The components si, 
s2, and s3 are spacer or linker moieties that may be the same or different, and may be 
individually present or absent, such as those discussed in Section I. Most preferably 
the spacer or linker comprises a polymer chain, where the spacer or linker comprises 
30 one or more repeat units of the formula -[CO-X-CO-NH-Y-NH] n ._ 
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IV. Preferred Polymer-Modified Synthetic Bioactive Proteins of the 
Invention 

As noted above, polymer-modified synthetic cytokines and chemotactic 
cytokines comprise particularly preferred examples the invention. More preferred 
5 cytokines are glycoproteins. Many such cytokines are glycoproteins and include, for 
example, many of the interleukins, interferons, growth factors, chemokines and the 
like. By way of example, cytokines such as EPO, TPO, GCSF, GM-CSF, IL-1, IL-2, 
IL-3, IL-4, IL-5, IL-6, IL-7, IL-9, IL-10, IL-12, IL-13, IL-14, IL-15, IFa-II, , IF-y, 
LIL, IF-beta, nerve growth factor, and Rantes and many other chemokines are 
10 glycoproteins, and the glycosylation sites of these and other such proteins are known. 

EPO is a particularly preferred example of a protein whose amino acid 
sequence can be used to aid in the design of a polymer-modified synthetic bioactive 
protein having erythropoiesis-stimulating activity. Colony-stimulating factors and 
RANTES also comprise particularly preferred examples of proteins whose amino 
15 acid sequence can be used to aid in the design of a synthetic bioactive protein within 
the ambit of the present invention. 

EPO is the principal factor responsible for the regulation of red blood cell 
production during steady-state conditions and for accelerating recovery of red blood 
cell mass following hemorrhage (Jelkmann, W. (1992) Physiol. Reviews 72:449; 
20 Krantz, S.B. (1991) Blood 77:419; Porter, D.L. and M.A. Goldberg (1993) Exp. 
Hematol. 21:399; Nissenson, A.R. (1994) Blood Purif. 12:6). The circulating form 
of human EPO is a 165 amino acid (aa) glycoprotein with a molecular weight of 
approximately 30,000 (Sawyer, S.T. et al. (1994) Hematol. Oncol. Clinics NA 
8:895; Jacobs, K.J. et al. Nature (1985) 313:806; Lin, F-K. et al. Proc. Natl. Acad. 
25 Sci. USA (1985) 82:7580). Although the cDNA for EPO predicts a molecule with 
166 amino acid residues, the carboxy-terminal arginine is removed in a post- 
translational modification. There are three potential sites for N-linked glycosylation 
and all are filled. One O-linked carbohydrate moiety is also present. The effects of 
glycosylation are complex. Although unglycosylated E. coli-derived EPO shows full 
30 biological activity in vitro, glycosylation is apparently necessary for full activity in 
vivo. Thus, E. coli-produced and deglycosylated, naturally derived EPO show very 
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low activity in animal studies (Krantz, S.B. (1991) Blood 77:419; Sasaki, H. et al. 
(mi) J. Biol. Chem. 262:12059; Lowy, P.H. et al. (1960) Nature 185:102; 
Wojchowski, D.M. et al. Biochem. Biophys. Acta 910:224; Dordal, M.S. et al. 
(1985) Endocrinology 116:2293), 
5 Consistent with the above, variable glycosylation patterns show variable 

effects. For example, desialyated EPO exhibits both enhanced in vitro and decreased 
in vivo activity, an effect attributed to the exposure of galactose residues which are 
recognized, bound, and cleared by hepatocytes (Spivak, J.L. and B.B. Hogans (1989) 
Blood 73v:90; Goldwasser, E. et al. (1974) J. Biol. Chem. 249:4202). The branching 
10 pattern of fully sialyated EPO also makes a difference in biological activity. 
Predominantly tetra-antennary branched EPO shows activity equivalent to 
"standard" EPO, while predominantly bi-anternnary branched EPO shows three-fold 
more activity in vitro but only 15% of normal activity in vivo (Takeuchi, M. et al. 
(1989) Proc. Natl. Acad. Sci. USA 86:7819). Studies have indicated that only N- 
15 linked, and not O-linked sugars, are important in EPO functioning (Higuchi, M. et 
al. (1992) J. Biol. Chem. 267:770319). 

In a particularly preferred embodiment, the synthetic bioactive proteins of the 
present invention comprise a polymer-modified synthetic erythropoiesis stimulating 
protein. As used herein, an erythropoiesis stimulating protein is a protein that 
20 mediates the production of red blood cells. The erythropoiesis stimulating 
bioactivity of a protein may be determined by any of a variety of means, such as by 
following erythropoiesis after in vivo administration, by assaying the capacity of the 
protein to mediate the proliferation of EPO-dependent cell lines, etc. 

A preferred erythropoiesis stimulating protein is mammalian, most preferably 
25 human, EPO and its analogs. (See. e.g., U.S Patent Nos. 5,856,298; 5,955,422; 
8,888,772; 5,858,347; 5,614,184; 5,457,089; and 6,077,939; and WOO /32772). 
Erythropoietin is predominantly synthesized and secreted by tubular and juxtatubular 
capillary endothelial and interstitial cells of the kidney. Chronical kidney disease 
causes the destruction of EPO-producing cells in the kidney. The resulting lack of 
30 EPO frequently induces anemias. The main clinical use of EPO is therefore the 
treatment of patients with severe kidney insufficiency (hematocrit below 0.3) who 
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usually also receive transfusions as well as the treatment of anemic patients 
following chemotherapy. Typically, EPO is synthesized recombinantly in hamster 
ovary cells for clinical use. EPO is a relative heat- and pH-stable acidic (pi =4.5) 
protein of 165 amino acid residues in its mature form. EPO conveys its activity 
5 through the EPO-receptor. Approximately 40 percent of the molecular mass of EPO 
is due to its glycosylation. Glycosylation is an important factor determining the 
pharmacokinetic behavior of EPO in vivo. Non-glycosylated EPO has an extremely 
short biological half-life. It still binds to its receptor and may even have a higher 
specific activity in vitro. However, recent experiments have shown that PEGylation 

10 of EPO significantly enhances its lifetime, but to significantly decrease EPO activity 
in a cell-based assay system. 

The synthetic erythropoiesis stimulating proteins ("SEPs") of the present 
invention are preferably polymer-modified synthetic analogs of human urinary EPO. 
In a preferred embodiment, they contain one or more polymer moieties (most 

15 preferably pPEG moieties) attached to one or more peptide residue(s) through a 
thioether, oxime, amide or other linkage. In a highly preferred embodiment, such 
linkages will be at one or more .of the positions that are naturally glycosylated in 
human urinary EPO (i.e., positions 24, 38, 83 and 126). Most preferably, the SEP 
molecules will have pPEG moieties at two or more of such positions. In an 

20 alternative embodiment, other protein residues may be modified by polymer 
moieties. The positions of modification for the synthetic bioactive proteins of the 
present invention include residues located in a disordered loop, region or domain of 
the protein, or at or near sites of potential protease cleavage. For example, polymer 
modifications may be introduced at one or more positions of 9, 69 and/or 125 of 

25 EPO. The total molecular weight of the SEP molecules of the present invention may 
vary from about 25 to about 150 kDa, and more preferably from about 30 to about 80 
kDa. The molecular weight can be controlled by increasing or decreasing the 
number and structure of the polymer (such as pPEG) utilized for modification of a 
given analog. The pPEG mediated hydrodynamic MW for the larger constructs is 

30 greater than about 100 kDa. In vitro assays in cells expressing the human EPO 
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receptor indicate that SEPs of the present invention have an ED50 that is equivalent 
to that of recombinantly produced glycosylated human EPO. 

Oxime-linked pPEG analogs are preferably constructed by attaching an 
aminooxy, ketone or aldehyde functionalized pPEG to the SEP protein at a non- 
5 naturally encoded amino acid bearing a side chain aminooxy, ketone or aldehyde 
functionality. For example, positions 89 and 117 of SEP-0 and 1 contain pseudo 
glutamates (a non-naturally encoded amino acid bearing a side chain of the formula - 
CHa-CH2-S-COOH (as compared to glutamate side chain -CHct-CH2-CH2- 
COOH)). SEP analogs utilizing thioether linkages are preferably constructed to 

10 contain a thiol functionality provided by a cysteine or unnatural amino acid with a 
side chain bearing the thiol. Figure 10 depicts the basic structure of one type of 
preferred synthetic erythropoiesis stimulating protein. 

In an alternative embodiment, the SEP molecules of the present invention 
may comprise "circularly permuted" EPO analogs in which the natural amino and 

15 carboxy terminus of EPO have been relocated. Most preferably, such relocation will 
move the amino and carboxy termini to positions of low structural constraint, such 
as to disordered loops, etc. The disordered loop around positions 125 and 126 
(relative to the native EPO residue numbering system) is an example of such a 
relocation site. Most preferably, such SEPs will be disulfide free, and will be 

20 chemically modified to contain polymer moieties at preselected residues. 

Alternatively, the SEP molecules may have amino and carboxy termini 
relocated to a naturally occurring glycosylation site, or to other sites amenable to 
glycosylation, such as position 126 and 125. The SEP molecules may also include 
modifications of the amino and carboxy termini to eliminate or modify charge (such 

25 as by carboxy amidation, etc.). In a preferred example of such circularly permuted 
molecules, new N- and C-termini are provided by positions 126 and 125, 
respectively. The natural disulfide-forming cysteines at positions 7, 29, 33 and 161 
are preferably replaced by the non-naturally encoded amino acid, L-a-N-butyric acid 
(Aba), which cannot form disulfide bridges. Residues R166, E37 and V82 are 

30 preferably replaced with alanines to improve production. Also, an additional 
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cysteine is preferably inserted between positions 1 and 166 relative to the native 
EPO number scheme, which is numbered below as '0'. The resulting molecule 
contains four cysteines (at positions 126, 0, 38, and 83 (as read in the N- to C- 
terminal direction)), which are utilized as (1) ligation sites and (2) thioether-forming 
5 pPEG attachment sites. Optionally, a cysteine may replace A125 to provide an 
additional pPEG attachment site. 

The total molecular weight of such SEP molecules of the present invention 
may vary from about 25 to about 150 kDa, and more preferably from about 30 to 
about 80 kDa. The molecular weight can be controlled by increasing or decreasing 

10 the number and structure of the polymer (such as pPEG) utilized for modification of 
a given analog. The pPEG mediated hydrodynamic MW for the larger constructs is 
greater than 100 kDa. Optional pPEG attachment sites are located at positions 125, 
9, and 24 (as read in the N- to C-terminal direction). Additional SEP analog designs 
have alternative N- and C-termini in the disordered loop region, and/or truncate 

15 residues from the new N- and/or C-termini. The basic structure of preferred 
circularly permuted molecules is shown in Figure 11. 

For example, a sequence was designed by circularly permuting the native 
sequence of human EPO in the following manner: the natural N-terminal Ala 1 and 
the C-terminal Arg 166 were joined by an additional Cys residue, thus giving a 

20 polypeptide of 167 amino acids; new N- and C-terminals were created by disjoining 
the chain at residuesl25-126 of the native sequence; all the native Cys residues were 
replaced with L- D amino-«-butyric acid residues; and the following substitutions 
were made: Glu37Ala; Asn38Cys; Val82AIa; Asn83Cys; Serl26Cys; Argl66Ala 
(all residue numbers based on the native sequence of human EPO). Together, these 

25 design elements result in the amino acid sequence of SEP-5 as shown. The SEP-5- 
L28 protein was polymer-modified at Cys residues 126, 0, 24, 38, and 83 
(numbering based on the human EPO sequence) with a maleimide-modified linear 
(TTD-Succ)g carboxylate polymer construct, by a Michael addition reaction. These 
changes were designed to improve synthesis and handling of the SEP-5-L28 protein. 

30 The designed sequence was made from four peptide segments, each with an N- 
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terminal Cys residue. The chemical ligation sites were Cys°, Cys 38 , Cys 83 . Peptides 
were synthesized where the C-terminal peptide was an D-carboxylate; the other three 
peptides were D-thioesters, and the N-terminal Cys residues were thus side chain 
Acm protected. Cys 24 was side chain unprotected. The segments were joined 
5 sequentially by native chemical ligation, starting with the C-terminal two segments. 
After the ligation, the free Cys side chain at the ligation site was reacted with the 
maleimide-modified linear (TTD-Succ)^ carboxylate polymer construct, by a 
Michael addition reaction, then the Cys side chain Acm protecting group was 
removed, and ligation of the next segment and polymer modification performed in 

10 similar fashion. After removal of the Acm group, ligation of the next (fourth) 
segment was performed, the final Acm group was removed and polymer 
modification was performed in similar fashion giving the circularly-permuted, 
polymer-modified SEP construct described above and shown in Figure 38. 

In an alternative embodiment, the synthetic bioactive proteins of the present 

15 invention are a mammalian, most preferably human C-GSF. G-CSF induces the 
rapid proliferation and release of neutrophilic granulocytes to the blood stream, and 
thereby provides therapeutic effect in fighting infection. The human GCSF protein 
is 174 amino acids in length (Patent: EP 0459630 (Camble,R. et al.\ and variants of 
its sequence have been isolated. (See, e.g., U.S. Patent Nos. 6,004,548; 4,810,643; 

20 5,218,092; 5,214,132; 5,416,195; and 5,580,755) The protein has four cysteine 
residues and an O-glycosylation site at threonine position 133. 

In one embodiment of such a synthetic GCSF molecules, the amino acid 
sequence of the molecule can be altered, relative to the sequence of native GCSF, so 
as to contain natural, or more preferably, non-naturally encoded, hydrophobic 

25 residues at one or more of positions 1, 2 or 3. Optionally, such molecules will be 
further modified to contain an additional natural, or more preferably, non-naturally 
encoded, hydrophobic residues at position 5 of GCSF, and/or at positions 173 and/or 
174. 

In a further preferred embodiment the synthetic GCSF molecules will be 
30 polymer-modified (preferably pPEG) at one or more of positions 63, 64 and/or 133 
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and/or at one or more of positions 1 and 174. The pPEG polymers may be linear or 
branched, charged, uncharged, or mixed; and may have a molecular weight range of 
from about 5 to about 80 kDa, and more preferably, from about 40 to about 70 kDa 
pPEG total MW contribution depending on the number and structure of pPEGs 
5 utilized for modification. The hydrodynamic radius exhibits a larger MW effect in 
vivo (for example, a 10 kDa branched pPEG exhibits an effective MW of about 55 
kDa). The estimated pPEG mediated hydrodynamic MW is greater than 100 kDa. 

For analogs utilizing oxime linkage, the pPEG is preferably attached to non- 
naturally encoded residues bearing a side chain aminooxy, ketone or aldehyde 

10 functionality. For analogs utilizing thioether linkage, the pPEG is preferably 
attached to a cysteine or unnatural amino acid with a side chain bearing a thiol. For 
analogs utilizing amide linkage, the pPEG is attached to a natural or unnatural amino 
acid bearing a reactive side chain amine. 

The basic structure of the synthetic bioactive GCSF proteins is shown in 

15 Figure 12. In the figure, "J" designates a non-naturally encoded residue having a 
hydrophobic side chain. 

The bioactivity of such polymer-modified synthetic bioactive GCSF proteins 
may be assayed by conventional means, such as by assaying for their capacity to 
mediate the proliferation of a factor-dependent human or mouse cell line (e.g., 

20 NFS60 cell line), or by measuring neutrophil stimulation, half-life, and 
immunogenicity in vivo, either with or without a delivery system targeting >20 day 
half-life/release). 

In an alternative embodiment, the synthetic bioactive proteins of the present 
invention are a mammalian, most preferably human chemokine, RANTES. 

25 RANTES blocks entry of M-tropic FflV strains through the primary receptor CCR5, 
and also down-modulates inflammation pathways involved in asthma, transplant 
rejection and wound healing (Kalinkovich A, el al, Immunol Lett. (1999) 68:281- 
287). The synthetic RANTES molecules of the present invention preferably differ 
from the RANTES chemokine in being chemically modified such that: (1) the N- 

30 terminal serine found at position 1 of RANTES 1-68 is substituted with an n- 
nonanoyl ("NNY") group; and (2) the tyrosine at position 3 is substituted with a non- 
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naturally encoded amino acid having a hydrophobic side chain. Such compounds 
have been found to be extremely potent, possessing ED50 in the pM range, 
compared to the mM range for recombinant "Met-RANTES 1-68". 

Potent RANTES analogs have been created having one or more additional 
5 changes to those noted above, such as replacement of the proline at position 2 with a 
non-naturally encoded amino acid having a hydrophobic side chain, and by the 
attachment of a fatty acid to the C-terminus of the molecule. Receptor specificity 
has been improved in combination with potency by modifications to the N-loop 
region corresponding to RANTES residues 12-20. In a further preferred 

10 embodiment, the synthetic RANTES molecules of the invention incorporate a 
polymer moiety (such as pPEG, or nonanoy ("NNY") or Y3X at their N- or C- 
terminus to improve inter alia in vivo half-life. The pPEG moiety is preferably 
attached through an oxime linkage to a non-naturally encoded amino acid that is 
introduced at position 66, 67, 68 or linker at position 68, with preference to position 

15 67, which is a region responsible for aggregation region. The fatty acid is preferably 
attached through an oxime linkage (preferably through a linker) to residue 68, such 
as an amino-oxy modified amino acid di- or tri-peptide linker. Other attachment 
chemistries may alternatively be employed. The molecular weight of the larger 
constructs of such synthetic RANTES analogs ranges from about 25 kDa to about 45 

20 kDa depending upon the nature and structure of the pPEG attachment, and for the 
larger constructs have a pPEG mediated hydrodynamic MW of greater than 60 kDa. 
The structure of preferred synthetic RANTES analogs is shown in Figure 13. 

V. Pharmaceutics 

The polymer-modified synthetic bioactive proteins of the present invention 
25 may be employed as pharmaceutical agents to effect the treatment of diseases and 
conditions. Most preferably, when administered to a patient or individual in need of 
therapy, such synthetic bioactive polypeptides and/or proteins will be administered 
using a drug delivery system. The uses such a system enables a drug to be presented 
to the patient in a manner that makes it acceptable for them and enhances the 
30 effectiveness of the desired bioactivity. The purposes of the present invention, 
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preferred drug delivery systems include systems capable of administering 
polypeptides or proteins buyout oral, nasal, or inhalation routes, or intramuscularly, 
subcutaneously, transdermally, intravenously, intraurally or intraocularly. 

Such drug delivery systems may include formulations that provide site- 
5 specific release, or that enhance protection for the intestinal mucosa. Suitable 
formulations include: dry powder formulations, delivery via viral particle 
encapsulation, liposome encapsulation, transdermal patches, electrically aided 
transport (electroporation therapy) and polymer/niazone conjugation. 

In a preferred embodiment, such drug delivery devices will respond to 

10 changes in the biological environment and deliver — or cease to deliver — drugs based 
on these changes. A range of materials have been employed to control the release of 
drugs and other active agents.: poly(urethanes), poly(siloxanes), poly(methyl 
methacrylate), poly(vinyl alcohol) for hydrophilicity and strength, poly(ethylene), 
polyvinyl pyrrolidone), poly(2-hydroxy ethyl methacrylate), poly(n-vinyl 

15 pyrrolidone), poly(methyl methacrylate), polyvinyl alcohol), poly(acrylic acid), 
polyacrylamide, poly(ethylene-co-vinyl acetate), polyethylene glycol), 
poly(methacrylic acid), etc. In a further preferred embodiment, biodegradeable 
polymers will be employed to facilitate drug delivery. Such polymers incude 
polylactides (PLA), polyglycolides (PGA), poly(lactide-co-glycolides) (PLGA), 

20 polyanhydrides, and polyorthoesters. Drug delivery devices, and methods for their 
use are described in U.S. Patents ; 6,072,041; 6,041,253; 6,018,678; 6,017,318; 
6,002,961; 5,879,712; 5,849,331; 5,792,451; 5,783,212; 5,766,633; 5,759,566; 
5,690,954; 5,681,811; 5,654,000; 5,641,511; 5,438,040; 4,810,499; and 4,659,558. 
Accordingly, another aspect of the invention relates to pharmaceutical 

25 compositions and methods of treating a mammal in need thereof by administering 
therapeutically effective amounts of compounds comprising the polymer-modified 
synthetic bioactive proteins of the invention, or pharmaceutically acceptable salts 
thereof. By "pharmaceutically acceptable salt" is intended to mean a salt that retains 
the biological effectiveness and properties of the polypeptides of the invention and 

30 which are not biologically or otherwise undesirable. Salts may be derived from 
acids or bases. Acid addition salts are derived from inorganic acids, such as 
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hydrochloric acid, hydrobromic acid, sulfuric acid (giving the sulfate and bisulfate 
salts), nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, malic acid, malonic acid, 
succinic acid, maleic acid, fumaric acid, tartaric acid, citric acid, benzoic acid, 
5 cinnamic acid, mandelic acid, methanesulfonic acid, ethanesulfonic acid, salicylic 
acid, p-toluenesulfonic acid, and the like. Base addition salts may be derived from 
inorganic bases, and include sodium, potassium, lithium, ammonium, calcium, 
magnesium salts, and the like. Salts derived from organic bases include those 
formed from primary, secondary and tertiary amines, substituted amines including 

10 naturally-occurring substituted amines, and cyclic amines, including isopropylamine, 
trimethylamine, diethylamine, triethylamine, tripropylamine, ethanolamine, 2- 
dimethylaminoethanol, tromethamine, lysine, arginine, histidine, caffeine, procaine, 
hydrabamine, choline, betaine, ethylenediamine, glucosamine, N-alkylglucamines, 
theobromine, purines, piperazine, piperidine, N-ethylpiperidine, and the like. 

15 Preferred organic bases are isopropylamine, diethylamine, ethanolamine, piperidine, 
tromethamine, and choline. 

The term "treatment" as used herein covers any treatment of a disease in a 
mammal, particularly a human, and includes: (i) preventing the disease from 
occurring in a subject which may be predisposed to the disease but has not yet been 

20 diagnosed as having it; (ii) inhibiting the disease, i.e. arresting its development; or 
(iii) relieving the disease, i.e. causing regression of the disease. 

By the term "a disease state in mammals that is prevented or alleviated by 
treatment with a protein antagonist or agonist" as used herein is intended to cover all 
disease states which are generally acknowledged in the art to be usefully treated with 

25 protein inhibitors in general, and those disease states which have been found to be 
usefully prevented or alleviated by treatment with the specific compounds of the 
invention. These include, by way of illustration and not limitation, asthma, allergic 
rhinitis, atopic dermatitis, viraf diseases, atheroma/atheroschleosis, rheumatoid 
arthritis and organ transplant rejection. 

30 As used herein, the term "therapeutically effective amount" refers to that 

amount of a polymer-modified synthetic bioactive protein which, when administered 
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to a mammal in need thereof, is sufficient to effect treatment (as defined above), for 
example, as an anti-inflammatory agent, anti-asthmatic agent, an 
immunosuppressive agent, or anti-autoimmune disease agent to inhibit viral 
infection in mammals. The amount that constitutes a "therapeutically effective 
5 amount" will vary depending on the protein derivative, the condition or disease and 
its severity, and the mammal to be treated, its weight, age, etc., but may be 
determined routinely by one of ordinary skill in the art with regard to contemporary 
knowledge and to this disclosure. As used herein, the term "q.s." means adding a 
quantity sufficient to achieve a stated function, e.g., to bring a solution to a desired 

10 volume (e.g., 100 mL). 

The polymer-modified synthetic bioactive proteins of this invention and their 
pharmaceutically acceptable salts, i.e., the active ingredient, are administered at a 
therapeutically effective dosage, i.e., that amount which, when administered to a 
mammal in need thereof, is sufficient to effect treatment, as described above. 

15 Administration of the synthetic bioactive proteins described herein can be via any of 
the accepted modes of administration for agents that serve similar utilities. As used 
herein, the terms "polymer-modified synthetic bioactive proteins of this invention", 
"pharmaceutically acceptable salts of the polypeptides of the invention" and "active 
ingredient" are used interchangeably. 

20 The level of the polymer-modified synthetic bioactive proteins in a 

formulation can vary within the full range employed by those skilled in the art, e.g., 
from about 0.01 percent weight (%w) to about 99.99%w of the protein antagonist or 
agonist based on the total formulation and about 0.01 %w to 99.99%w excipient. 
More typically, the polymer-modified synthetic bioactive proteins will be present at 

25 a level of about 0.5%w to about 80%w. 

While human dosage levels have yet to be optimized for the polymer- 
modified synthetic bioactive proteins of the invention, generally, the amount of 
compound administered will, of course, be dependent on the subject and the disease 
state targeted for prevention or alleviation, the nature or severity of the affliction, the 

30 manner and schedule of administration and the judgment of the prescribing 
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physician. Such use optimization is well within the ambit of those of ordinary skill 
in the art. 

Administration can be via any accepted systemic or local route, for example, 
via parenteral, oral (particularly for infant formulations), intravenous, nasal, 
5 bronchial inhalation (i.e., aerosol formulation), transdermal or topical routes, in the 
form of solid, semi-solid or liquid or aerosol dosage forms, such as, for example, 
tablets, pills, capsules, powders, liquids, solutions, emulsion, injectables, 
suspensions, suppositories, aerosols or the like. The polymer-modified synthetic 
bioactive proteins of the invention can also be administered in sustained or 

10 controlled release dosage forms, including depot injections, osmotic pumps, pills, 
transdermal (including electrotransport) patches, and the like, for the prolonged 
administration of the polypeptide at a predetermined rate, preferably in unit dosage 
forms suitable for single administration of precise dosages. The compositions will 
include a conventional pharmaceutical carrier or excipient and a protein antagonist 

15 or agonist of the invention and, in addition, may include other medicinal agents, 
pharmaceutical agents, carriers, adjuvants, etc. Carriers can be selected from the 
various oils, including those of petroleum, animal, vegetable or synthetic origin, for 
example, peanut oil, soybean oil, mineral oil, sesame oil, and the like. Water, saline, 
aqueous dextrose, and glycols are preferred liquid carriers, particularly for injectable 

20 solutions. Suitable pharmaceutical carriers include starch, cellulose, talc, glucose, 
lactose, sucrose, gelatin, malt, rice, flour, chalk, silica gel, magnesium stearate, 
sodium stearate, glycerol monostearate, sodium chloride, dried skim milk, glycerol, 
propylene glycol, water, ethanol, and the like. Other suitable pharmaceutical carriers 
and their formulations are described in "Remington's Pharmaceutical Sciences" by 

25 E. W. Martin. 

If desired, the pharmaceutical composition to be administered may also 
contain minor amounts of non-toxic auxiliary substances such as wetting or 
emulsifying agents, pH buffering agents and the like, such as for example, sodium 
acetate, sorbitan monolaurate, triethanolamine oleate, etc. 

30 Although more of the active ingredient may be required, oral administration 

can be used to deliver the polymer-modified synthetic bioactive proteins of the 
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invention using a convenient daily dosage regimen, which can be adjusted according 
to the degree of prevention desired or in the alleviation of the affliction. For such 
oral administration, a pharmaceutically acceptable, non-toxic composition is formed 
by the incorporation of any of the normally employed excipients, such as, for 
5 example, pharmaceutical grades of mannitol, lactose, starch, povidone, magnesium 
stearate, sodium saccharine, talcum, cellulose, croscarmellose sodium, glucose, 
gelatin, sucrose, magnesium carbonate, and the like. Such compositions take the 
form of solutions, suspensions, dispersible tablets, pills, capsules, powders, 
sustained release formulations and the like. Oral formulations are particularly suited 

10 for treatment of gastrointestinal disorders. Oral bioavailablity for general systemic 
purposes can be adjusted by utilizing excipients that improve uptake to systemic 
circulation, such as formulation comprising acetylated amino acids. See, e.g., US 
5,935,601 and US 5,629,020. 

The compositions may take the form of a capsule, pill or tablet and thus the 

15 composition will contain, along with the active ingredient, a diluent such as lactose, 
sucrose, dicalcium phosphate, and the like; a disintegrant such as croscarmellose 
sodium, starch or derivatives thereof; a lubricant such as magnesium stearate and the 
like; and a binder such as a starch, polyvinylpyrrolidone, gum acacia, gelatin, 
cellulose and derivatives thereof, and the like. 

20 Liquid pharmaceutically administrable compositions can, for example, be 

prepared by dissolving, dispersing, etc. a polymer-modified synthetic bioactive 
proteins of the invention (about 0.5% to about 20%) and optional pharmaceutical 
adjuvants in a carrier, such as, for example, water, saline, aqueous dextrose, 
glycerol, glycols, ethanol, preservatives and the like, to thereby form a solution or 

25 suspension. If desired, the pharmaceutical composition to be administered may also 
contain minor amounts of nontoxic auxiliary substances such as wetting agents, 
suspending agents, emulsifying agents, or solubilizing agents, pH buffering agents 
and the like, for example, sodium acetate, sodium citrate, cyclodextrine derivatives, 
polyoxyethylene, sorbitan monolaurate or stearate, etc. Actual methods of preparing 

30 such dosage forms are known, or will be apparent, to those skilled in this art; for 
example, see Remington's Pharmaceutical Sciences, Mack Publishing Company, 
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Easton, Pennsylvania. The composition or formulation to be administered will, in 
any event, contain a quantity of the active ingredient in an amount effective to 
prevent or alleviate the symptoms of the subject being treated. For oral 
administration to infants, a liquid formulation (such as a syrup or suspension) is 
5 preferred. 

For a solid dosage form containing liquid, the solution or suspension, in for 
example propylene carbonate, vegetable oils or triglycerides, is preferably 
encapsulated in a gelatin capsule. For a liquid dosage form, the solution, e.g. in a 
polyethylene glycol, may be diluted with a sufficient quantity of a pharmaceutically 

10 acceptable liquid carrier, e.g. water, to be easily measured for administration. 

Alternatively, liquid or semi-solid oral formulations may be prepared by 
dissolving or dispersing the active ingredient in vegetable oils, glycols, triglycerides, 
propylene glycol esters (e.g. propylene carbonate) and the like, and encapsulating 
these solutions or suspensions in hard or soft gelatin capsule shells. 

15 In applying the polymer-modified synthetic bioactive proteins of this 

invention to treatment of the above conditions, administration of the active 
ingredients described herein are preferably administered parenterally. Parenteral 
administration is generally characterized by injection, either subcutaneously, 
intramuscularly or intravenously, and can include intradermal or intraperitoneal 

20 injections as well as intrasternal injection or infusion techniques. Injectables can be 
prepared in conventional forms, either as liquid solutions or suspensions, solid forms 
suitable for solution or suspension in liquid prior to injection, as emulsions or in 
biocompatible polymer-based microspheres (e.g., liposomes, polyethylene glycol 
derivatives, poly(D,C)lactide and the like). Suitable excipients are, for example, 

25 water, saline, dextrose, glycerol, ethanol or the like. In addition, if desired, the 
pharmaceutical compositions to be administered may also contain minor amounts of 
non-toxic auxiliary substances such as wetting or emulsifying agents, pH buffering 
agents, solubility enhancers, protein carriers and the like, such as for example, 
sodium acetate, polyoxyethylene, sorbitan monolaurate, triethanolamine oleate, 

30 cyclodextrins, serum albumin etc. 
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The polymer-modified synthetic bioactive proteins of the present invention 
can be administered parenterally, for example, by dissolving the compound in a 
suitable solvent (such as water or saline) or incorporation in a liposomal formulation 
followed, by dispersal into an acceptable infusion fluid. A typical daily dose of a 
5 polypeptide of the invention can be administered by one infusion, or by a series of 
infusions spaced over periodic intervals. For parenteral administration there are 
especially suitable aqueous solutions of an active ingredient in water-soluble form, 
for example in the form of a water-soluble salt, or aqueous injection suspensions that 
contain viscosity-increasing substances, for example sodium 

10 carboxymethylcellulose, sorbitol and/or dextran, and, if desired, stabilizers. The 
active ingredient, optionally together with excipients, can also be in the form of a 
lyophilisate and can be made into a solution prior to parenteral administration by the 
addition of suitable solvents. 

A more recently devised approach for parenteral administration employs the 

15 implantation of a slow-release or sustained-release system, such that a constant level 
of dosage is maintained. See, e.g., US 3,710,795, US 5,714,166 and US 5,041,292, 
which are hereby incorporated by reference. 

The percentage of the active ingredient contained in such parental 
compositions is highly dependent on the specific nature thereof, as well as the 

20 activity of the polypeptide and the needs of the subject. However, percentages of 
active ingredient of 0.01% to 10% in solution are employable, and will be higher if 
the composition is a solid which will be subsequently diluted to the above 
percentages. Preferably the composition will comprise 0.02-8% of the active 
ingredient in solution. 

25 Another method of administering the polymer-modified synthetic bioactive 

proteins of the invention utilizes both a bolus injection and a continuous infusion. 

Aerosol administration is an effective means for delivering the polymer- 
modified synthetic bioactive proteins of the invention directly to the respiratory tract. 
Some of the advantages of this method are: 1) it circumvents the effects of 

30 enzymatic degradation, poor absorption from the gastrointestinal tract, or loss of the 
therapeutic agent due to the hepatic first-pass effect; 2) it administers active 
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ingredients which would otherwise fail to reach their target sites in the respiratory 
tract due to their molecular size, charge or affinity to extra-pulmonary sites; 3) it 
provides for fast absorption into the body via the alveoli of the lungs; and 4) it 
avoids exposing other organ systems to the active ingredient, which is important 
5 where exposure might cause undesirable side effects. For these reasons, aerosol 
administration is particularly advantageous for treatment of asthma, local infections 
of the lung, and other diseases or disease conditions of the lung and respiratory tract. 

There are three types of pharmaceutical inhalation devices, nebulizers 
inhalers, metered-dose inhalers and dry powder inhalers. Nebulizer devices produce 

10 a stream of high velocity air that causes the protein derivative (which has been 
formulated in a liquid form) to spray as a mist that is carried into the patient's 
respiratory tract. Metered-dose inhalers typically have the formulation packaged 
with a compressed gas and, upon actuation, discharge a measured amount of the 
polypeptide by compressed gas, thus affording a reliable method of administering a 

15 set amount of agent. Dry powder inhalers administer the polypeptide in the form of 
a free flowing powder that can be dispersed in the patient's air-stream during 
breathing by the device. In order to achieve a free flowing powder, the protein 
derivative is formulated with an excipient, such as lactose. A measured amount of 
the protein derivative is stored in a capsule form and is dispensed to the patient with 

20 each actuation. All of the above methods can be used for administering the present 
invention. 

Pharmaceutical formulations based on liposomes are also suitable for use 
with the polymer-modified synthetic bioactive proteins of this invention. See, e.g., 
US Patents Nos. 5,631,018, US 5,723,147, and 5,766,627. The benefits of 

25 liposomes are believed to be related to favorable changes in tissue distribution and 
pharmacokinetic parameters that result from liposome entrapment of drugs, and may 
be applied to the polypeptides of the present invention by those skilled in the art. 
Controlled release liposomal liquid pharmaceutical formulations for injection or oral 
administration can also be used. 

30 For systemic administration via suppository, traditional binders and carriers 

include, for example, polyethylene glycols or triglycerides, for example PEG 1000 
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(96%) and PEG 4000 (4%). Such suppositories may be formed from mixtures 
containing the active ingredient in the range of from about 0.5 w/w% to about 10 
w/w%; preferably from about 1 w/w% to about 2 w/w%. 

All publications and patent applications mentioned in this specification are 
5 herein incorporated by reference to the same extent as if each individual publication 
or patent application was specifically and individually indicated to be incorporated 
by reference. The discussion of the background to the invention herein is included 
to explain the context of the invention. This is not to be taken as an admission that 
any of the material referred to was published, known, or part of the prior art or 
1 0 common general knowledge anywhere in the world as of the pri ority date of any of 
the claims. Having now generally described the invention, the same will be more 
readily understood through reference to the following examples, which are provided 
by way of illustration, and are not intended to be limiting of the present invention, 
unless specified. 

15 EXAMPLES 

Abbreviations 

Acm = acetamidomethyl thiol-protecting group [i.e. -CH 2 NHCOCH 3 ] 
AoA = aminooxyacetyl 

Arg(Tos) = L-arginine(side chain N G toluenesulfonyl-protected) 
ART = absolute reticulocyte count 

Asp(cHex) = L-aspartic acid(side chain cyclohexyl ester-protected) 
AUC = area under the curve 
Boc = tert.butoxycarbonyl 
CD = circular dichroism 
GDI = carbonyldiimidiazole 
CHO = Chinese hamster ovary 
CL = clearance (mL/hr/kg) 
Cmax = maximum concentration 

Cys(4MeBzl) = L-cysteine(side chain (4-methyl)benzyl-protected) 



20 



25 
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Cys(Acm) = L-cysteine(side chain acetamidomethyl [i.e. -CH 2 NHCOCH 3 ]- 
protected) 

DCM = dichloromethane 
DIC = diisopropylcarbodiimide 
5 DIEA = diisopropylethylamine 
DMF = dimethylformamide 
DMSO = dimethylsulfoxide 
DPC = dodecylphosphocholine 
Dpr = L-l,2diaminopropionic acid 
10 ED50 = effective dose required to reach 50% maximum effect 
EDA = (4,7,10)-trioxatridecane-l,13diamine 
ELISA = enzyme-linked immunoassay 
EPO = erythropoietin 

ES-MS = electrospray ionization mass spectrometry 

15 FBS = fetal bovine serum 

Glu(cHex) = L-glutamic acid(side chain cyclohexyl ester-protected) 
GM-CSF = granulocyte-macrophage colony stimulating factor 
HATU = 0-(7-azabenzotriazol-l-yl)-],l,3,3-tetramethylammonium 
hexafiuorophosphate 

20 HCT = hematocrit 
HGB = hemoglobin 

His(Dnp) = L-histidine(side chain N lm dinitrophenyl -protected) 
HOBT = N-hydroxybenzotriazoie 
HPLC = high pressure liquid chromatography 
25 IMDM = Iscove's modified Dulbecco's medium 
IPA = isopropanol 
Lev = levulinic acid 

Lys(ClZ) = L-lysine(side chain 2-chlorobenzyloxycarbonyl)-protected 
MBHA = 4-methylbenzhydrylamine 
3 0 MRT = mean residence time 
MTT = 4-methylTrityl 
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MTT = thiazolyl blue 

NHS = N-hydroxysuccinimide 

-OCH 2 -Pam-resin = -0-CH 2 -Bz-CH 2 CONHCH 2 (copolystyrene- 
divinylbenzene)-resin 
5 Pbo = 4-(CH 3 S(0)-)benzyl 

PBS = phosphate buffered saline 

PCV = packed cell volume 

RBC = red blood cell 

RET = reticulocyte 
I rhEPO = recombinant human EPO 

RSA = rat serum albumin 

SDS = sodium docyl sulfate 

SDS-PAGE = SDS-polyacrylamide gel electrophoresis 
Ser(Bzl) = L-serine(side chain benzyl-protected) 

Single letter code for amino acids: A = alanine; C = cysteine; D = aspartic acid; E = 
glutamic acid; F = phenylalanine; G = glycine; H = histidine; I = isolecuine; K = 
lysine; L = leucine; M = methionine; N = asparagine; P = proline; Q = glutamine; R 
= arginine; S = serine; T = threonine; V = valine; W = tryptophan; Y = tyrosine 
SPPS = stepwise solid phase peptide synthesis 
Succ = succinly [i.e. -COCH 2 CH 2 CO-] 
TCEP = tris(2-carboxyethyl)phosphine hydrochloride 
TFE = 2,2,2-trifluoroethanol 

Thr(Bzyl) = L-threonine(side chain benzyl-protected) 

Three letter code for the amino acids: Ala = alanine; Arg = arginine; 

Asn = asparagine; Asp = aspartic acid; Chg = cyclohexylglycine; Cys = 
cysteine; Gin = glutamine; Glu = glutamic acid; Gly = glycine; His = 
histidine; He = isoleucine; Leu = leucine; Lys = lysine; Met = methionine; 
Phe = phenylalanine; Pro = proline; Ser = serine; Thr = threonine; Thz = 4- 
thioproline; Trp = tryptophan; Tyr = tyrosine; Val - valine 

Trp(formyl) = L-tryptophan(side chain N In formyI-protected) 

TTD = (4,7,10)-trioxatridecane-l,13diamine 
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Tyr(BrZ) - L-tyrosine(side chain 2-bromobenyzloxycarbonyl-protected) 
Vc = volume of distribution of the central compartment 

Example 1 

Synthesis of Synthetic Erythropoiesis Stimulating Protein SEP-0 

5 A synthetic eiythropoiesis stimulating protein (SEP) was synthesized. The 

sequence of the full-length synthesized protein (designated "SEP-0 (1-166)" is: 

APPRLICDSR VLERYLLEAK EAEKITTGCA EHCSLNEKIT 

VPDTKVNFYA WKRMEVGQQA VEVWQGLALL SEAVLRGQAL 

LVKSSQPWyP LQLHVDKAVS GLRSLTTLLR ALGAQKu/AIS 

10 PPDAASAAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 

CRTGDR ( SEQ ID NO:l) 

where * denotes a non-native amino acid residue consisting of a cysteine 
that is carboxymethylated at the sulfhydryl group. The SEP-0 protein was 
synthesized in solution from four polypeptide segments: 

15 Segment SEP-0:l (GRFN 1711; composed of residues 1-32 of SEQ ID NO:l): 

APPRLICDSR VLERYLLEAK EAEKITTGCA EH-thioester 

Segment SEP-0:2 (GRFN 1712, composed of residues 33-88 of SEQ ID NO:l): 

CSLNEKITVP DTKVNFYAWK RMEVGQQAVE VWQGLALLSE 
AVLRGQALLV KSSQPW-thioester (where Cys 33 is Acm protected) 

Segment SEP-0;3 (GRFN 1713, composed of residues 89-1 1 6 of SEQ ID NO:l): 
CPLQLHVDKA VSGLRSLTTL LRALGAQK-thioester (where Cys 89 is 
Acm protected) 

Segment SEP-0:4 (GRFN 1714, composed of residues 1 17-166 of SEQ ID NO:!)- 

CAISPPDAAS AAPLRTITAD TFRKLFRVYS NFLRGKLKLY 



20 
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TGEACRTGDR-carboxylate (where the C-terminal cysteine (Cys 161 ) 
carries a picolyl (pico) protecting group) 

The peptides SEP-0:1 and SEP-0:2 and SEP-0:3 were synthesized on a 
thioester-generating resin by the in situ neutralization protocol for Boc (tert- 
5 butoxycarbonyl) chemistry and stepwise solid phase peptide synthesis (SPPS) using 
established SPPS, side-chain protection and thioester-resin strategies (Hackeng, et 
al., PNAS (1999) 96: 10068-10073; and Schnolzer, et al. Int. J. Pept. Prot. Res., 
(1992) 40: 180-193)) on an ABI433A automated peptide synthesizer or by manual 
chain assembly, or ordered and acquired from commercial vendors. For instance, a 

10 standard set of Boc SPPS protecting groups was used, namely: Arg(Tos); 
Asp(cHex); Cys(4MeBzl) & Cys(Acm); Glu(cHex); His(DNP); Lys(ClZ); Ser(Bzl); 
Thr(Bzl); Trp(formyl); Tyr(BrZ); Met, Asn, Gin were side-chain unprotected. 
Segment SEP-0:4 was synthesized analogously on a -OCH 2 -Pam-resin. The 
peptides were deprotccted and simultaneously cleaved from the resin support using 

15 HF/p-cresol according to standard Boc chemistry procedure; however, for those 
peptides containing protecting groups not removed in HF/p-cresol, the protecting 
groups were retained. The peptides were purified by preparative C4 reverse-phase- 
high pressure liquid chromatography (HPLC). Fractions containing pure peptide 
were identified using ES-MS (electrospray ionization mass spectrometry), pooled 

20 and lyophilized for subsequent ligation. 

Step 1: Ligation #7 Segment SEP-0:4 and segment SEP-0:3 were dissolved 
in TFE at 15 mM concentration. Saturated phosphate buffer (pH 7.9) containing 6 
M guanidinium chloride and 1% thiophenol was added, resulting in a clear solution 
of the peptide segments. After ligation, the ligation mix was added to a solution of 2 
25 ml TFE (trifluoroethanol), 6 ml 6M guanidinium chloride, 100 mM phosphate 
containing 25% p-mercaptoethanol and incubated for 20 minutes. The solution was 
acidified with a solution of 15 mg/ml TCEP (tris(2-carboxyethyl)phosphine.HCl) in 
glacial acetic acid and loaded onto a preparative C4 reverse-phase HPLC column (1 
inch diameter). The peptides were then purified by preparative gradient reverse- 
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phase HPLC. Fractions containing the desired ligated product SEP-OAcm+SEP- 
0:3+SEP-0:4 were identified by ES-MS and pooled. 

Step 2: Acm-removal #1 For Acm removai, the aqueous acetonitrile 
solution containing the pooled fractions of SEP-0:Acm+SEP-0:3+SEP-0:4 was 
5 diluted lx with HPLC grade water, and solid urea was added for a final 
concentration of 2 molar. A threefold molar excess (relative to the total expected 
cysteine concentration) of a 30 mg/ml Hg(acetate) 2 solution in 3% aqueous acetic 
acid was added and the solution is stirred for one hour. The solution was then made 
20% in P-mercaptoethanol, loaded onto a semi-preparative reverse-phase HPLC 
10 column and purified with a step gradient. Fractions containing the desired product 
SEP-0:3+SEP-0:4 were identified by ES-MS and lyophilized overnight. 

Step 3: Ligation #2 Equal amounts of SEP-0:3+SEP-0:4 and SEP-0:2 were 
jointly dissolved in neat TFE at 15 mM concentration. 250 mM Phosphate buffer 
(pH 7.5) containing 6 M guanidinium chloride and 1% thiophenol was added, 

15 resulting in a clear solution of the peptide segments. After one day of ligation, the 
ligation mix was added to a solution of 10 ml TFE, 10 ml p-mercaptoethanol, 10 ml 
piperidine and 20 ml 6M guanidinium chloride, pH 4, and incubated for 20 minutes 
to remove any remaining protecting groups. The solution was acidified with a 
solution of 15 mg/ml TCEP in 20% aqueous acetic acid, loaded onto a preparative 

20 reverse-phase HPLC column and purified with a linear gradient. Fractions 
containing the desired ligated product SEP-0 :Acm+SEP-0:2+SEP-0:3+SEP-0:4 were 
identified by ES-MS and lyophilized overnight. 

Step 4: Carboxymethylation SEP-0:Acm+SEP-0:2+SEP-0:3+SEP-0:4 was 
dissolved in TFE at 15 mM concentration. A two-fold excess (v/v) of 200 mM 
25 Phosphate buffer (pH 7.9) containing 6 M guanidinium chloride was added, resulting 
in a clear solution of the peptide segment. A 25-fold excess of bromo-acetic acid 
dissolved in a minimum amount of methanol was added, and the solution was 
allowed to react for two hours. The solution was acidified with a solution of 15 
mg/ml TCEP in 20% aqueous acetic acid, loaded onto a preparative reverse-phase 
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HPLC column and purified with a step gradient. Fractions containing the desired 
carboymethylated product SEP-0:Acm+SEP-0:2+SEP-0:3+SEP-0:4+Et were 
identified by ES-MS and pooled. 

Step 5; Picolyl Removal Zinc dust was activated in 2M HC1 for 30 minutes. 
5 Peptide SEP-0:Acm+SEP-0:2+SEP-0:3+SEP-0:4+Et was dissolved in neat TFE at 
about 10 mg/ml concentration. The solution was diluted with 4x (v/v relative to 
TFE) 6M guanidinium chloride, 100 mM acetate, pH 4, containing (freshly added) 
35 mg/ml L-methionine and 35 mg/ml dodecylsarcosine (i.e. sodium N- 
dodecanoylsarcosine). The solution was added to the activated Zn powder. The 

10 reaction was monitored at ~lhr intervals and is complete after five hours. After 
completion, the supernatant was removed and the remaining Zn powder washed 
twice for five minutes with 6M guanidinium chloride, pH 4, 100 mM acetate 
containing 35 mg/ml L-methionine and 35 mg/ml dodecylsarcosine containing 20% 
TFE as well as once with the same solution containing 20% (3-mercaptoethanol. The 

15 combined product was acidified with a solution of 15 mg/ml TCEP in 20% aqueous 
acetic acid, loaded onto a preparative reverse-phase HPLC column and purified with 
a step gradient. Fractions containing the desired modified product SEP- 
0:Acm+SEP-0:2+SEP-0:3+SEP-0;4+Et-Pico were identified by ES-MS and pooled. 

Step 6: Acm-removal #2 The pooled solution of SEP-OAcm+SEP- 
20 0:2+SEP-0:3+SEP-0:4+Et-Pico was diluted 3x with HPLC grade water, and solid 
urea was added for a final concentration of 2 molar. A threefold molar excess 
(relative to the total expected cysteine concentration) of a 30 mg/ml Hg(acetate) 2 
solution in 3% aqueous acetic acid was added and the solution stirred for one hour. 
The solution was then made 20% in p-mercaptoethanol, loaded onto a semi- 
25 preparative reverse-phase HPLC column and purified with a step gradient. Fractions 
containing the desired product SEP-0:2+SEP-0:3+SEP-0:4+Et-Pico were identified 
by ES-MS, diluted 2x (v/v) with water containing 2x (w/w relative to peptide mass) 
DPC (dodecylphosphocholine) and iyophilized overnight. 
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Step 7: Ligation #3 Equal amounts of SEP-0:2+SEP-0:3+SEP-0:4+Et-Pico 
and SEP-0:1 were jointly dissolved in neat TFE at 15 mM concentration and 250 
mM Phosphate buffer (pH 7.5) containing 6 M guanidinium chloride is added. To 
the solution 1% thiophenol was added. After one day of ligation, the ligation mix 
5 was added to a solution of 10 ml TFE, 10 ml P-mercaptoethanol, 10 ml piperidine 
and 20 ml 6M guanidinium chloride, pH 4, and incubated for 20 minutes to remove 
any remaining protecting groups. The solution was acidified with a solution of 15 
mg/ml TCEP in 20% aqueous acetic acid, loaded onto a preparative reverse-phase 
HPLC column and purified with a linear gradient. Fractions containing the desired 
10 ligated product SEP-0 (1-166): (SEQ ID NO:l) were identified by electrospray mass 
spectrometry, diluted 2x (v/v) with water containing 2x (w/w relative to peptide 
mass) dodecylsarcosine and lyophilized. 

Step 8 Folding: Full-length ligated peptide SEP-0 (1-166) was dissolved in 
200 mM Tris buffer (pH 8.7) containing 6 M guanidinium chloride and 20% TFE 

15 and a ten-fold molar excess (relative to Cys residues in protein) of cysteine. This 
solution was dialyzed overnight against a solution of 200 mM Tris buffer (pH 8.7) 
containing 3 M guanidinium chloride at room temperature. The solution was then 
dialyzed against a solution of 200 mM Tris buffer (pH 8.7) containing 1 M 
guanidinium chloride at room temperature for 4 hours at 4 °C and finally against 10 

20 mM phosphate buffer (pH 7.0) for 4 hours at 4°C to yield the final folded product. 
Folding was verified by electrospray ES-MS and CD (circular dichroism) 
spectrometry. 

Step 9 Purification: The folded polypeptide was concentrated 5x in 
centricon concentrator vials and loaded on to Resource S cation exchange column 
25 equilibrated at 10 mM phosphate, pH 7.0. The folded protein was eluted in a linear 
salt gradient to 500 mM NaCl in 10 minutes. Fractions containing the desired folded 
product SEP-0 (1-166) were identified by SDS-polyacrylamide gel electrophoresis 
(SDS-PAGE), and frozen and stored at - 80°C. An analytical reverse-phase HPLC 
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chromatogram and an ES-MS spectrum of the folded protein product as well as a CD 
spectrum demonstrated the presence of folded protein. 

Example 2 

Synthesis of Synthetic Erythropoiesis Stimulating Protein SEP-1-L30 

5 A second synthetic erythropoiesis stimulating protein (designated SEP- 1 - 

L30) was synthesized to contain oxime-forming groups at positions 24 and 126 of 
SEP-0. These groups were then used to form SEP-1-L30, in which linear (EDA- 
Succ-)i8 carboxylate (EDA = (4,7,10)-trioxatridecane-l,13diamine, also called 
TTD; Succ = -CO-CH 2 CH 2 CO-) polymers have been joined to the protein backbone. 
10 The sequence of the full-length SEP-1 (1-166) is: 

APPRLICDSR VLERYLLEAK EAEK ox ITTGCA EHCSLNEKIT 

VPDTKVN FYA WKRMEVGQQA VEVWQGLALL SEAVLRGQAL 

LVKSSQPWy/P LQLHVDKAVS GLRSLTTLLR ALGAQKl|/AIS 

PPDAAK ox AAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
15 CRTGDR (SEQ ID NO: 2) 

where ¥ denotes an non-native amino acid residue consisting of a cysteine that is 
carboxymethylated at the sulfhydryl group, and where K ox denotes a non-native 
lysine that is chemically modified at the e-amino group with an oxime linker group 
coupled to a designated water-soluble polymer through an oxime bond. The 
20 structure of SEP-1-L30 also is shown in Figure 17. 

A schematic illustrating the chemical synthesis of SEP1-L30 described below 
is shown in Figure 16. Briefly, in Figures 16A and 16B, water-soluble polymer 
GRFN32 (GP32) is attached to peptides SEP-1 :1 and SEP-1 :4 through an oxime 
bond-forming chemical ligation reaction. As shown, peptide SEP-1 :1 bears a C- 
25 terminal Yaa group at position 32 comprising a histidine alpha-carboxyl thioester for 
subsequent native chemical ligation, and a non-native lysine residue at position 24 (a 
glycosylation site of natural human EPO) the side chain of which has been 
chemically modified to bear a U n =i group comprising an aminooxy acyl moiety. The 
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N-terminal alanine corresponding to position 1 of the final full-length product is 
shown for reference. Peptide SEP-1:4 has an unprotected N-terminal Xaa group at 
position 117 comprising a cysteine, a U„=i group comprising an aminooxy acyl 
modified Lysine at position 126 (a glycosylation site of natural human EPO), and 
5 cysteine at position 161 (a disulfide forming cysteine) that has its side-chain thiol 
protected with a picolyl group. Cys 161 is protected to prevent its thioalkylation in a 
subsequent conversion of cysteines introduced for native chemical ligation sites (See 
Figure 16D); also, a picolyl protecting group is used since its removal is orthogonal 
to the conditions required for removing an Acm (acetamidomethyl) protected 

10 cysteine in a first native chemical ligation reaction (See Figure 16C). Site-specific, 
and exclusive attachment of the GP32 polymer constructs at positions 24 and 126 is 
achieved through oxime-forming chemical ligation to produce the precision 
polymer-modified peptides SEP-1:1+GP32 and SEP-1:4+GP32. Figure 16C shows 
native chemical ligation of SEP-1:4+GP32 to a middle peptide segment SEP- 1:3 and 

15 generation of SEP-1:3-SEP+1:4+GP32, the ligation site of which is at Lys 116 and 
Cys' 17 . As shown peptide SEP-1:3 comprises an N-terminal Xaa group at position 
89 comprising an Acm protected cysteine, and a C-terminal Yaa group at position 
116 comprising a lysine alpha-carboxyl thioester. Following native chemical 
ligation, the Acm protecting group is removed to prepare this ligation product for the 

20 next ligation reaction. Figure 16D shows native chemical ligation of SEP-1:3+SEP- 
1:4+GP32 to a middle peptide segment SEP- 1:2 and generation of SEP-1:2+SEP- 
1:3+SEP-1:4+GP32, the ligation site of which is at Tip 88 and Cys 89 . As shown 
peptide SEP- 1:2 comprises an N-terminal Xaa group at position 33 comprising an 
Acm cysteine, and a C-terminal Yaa group at position 89 comprising a tryptophan 

25 alpha-carboxyl thioester. Following native chemical ligation, the ligation product is 
exposed to bromoacetic acid for carboxymethylation of the side chain thiols of 
ligation site cysteines 89 and 117, and thus their conversion to pseudo glutamic 
acids. Following carboxymethylation, the Acm and Picolyl protecting groups are 
removed to prepare this unprotected, polymer-modified ligation product for the next 

30 ligation reaction. Figure 16E shows native chemical ligation of SEP- 1:3- 
SEP1:4+GP32 to peptide segment SEP-1:1+GP32 (corresponding the N-terminal 
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segment of the full length product) and generation of the full length SEP-1-L30 
product SEP-1:1+GP32+SEP-1:2+SEP-1:3+SEP-1:4+GP32, the final ligation site of 
which is at His 32 and Cys 33 . As shown the full-length SEP-1-L30 product comprises 
two water-soluble polymers (G32) attached exclusively at user-defined sites, i.e., 
5 glycosylation sites corresponding to position 24 and position 126 of natural human 
EPO. The details of the synthesis are described below. 

A. Oximation of GRFN1776 and GRFN1711 with GRFNP32 

Oxime formation was performed to attach water-soluble polymers bearing an 
aminooxyacetyl group to peptides carrying a ketone carbonyl group. To accomplish 
10 this, the following peptide segments were synthesized: 

Segment SEP-1:4 (GRFN 1776; composed of residues 1 17-166 of SEQ ID NO:2): 
CAISPPDAAK AAPLRTITAD TFRKLFRVYS NFLRGKLKLY 
TGEACRTGDR-carboxylate (where Lys 126 is modified with a levulinic 
acid residue at the e-amino group, and where Cys 117 is Acm protected) 

15 Segment SEP-1:1 (GRFN 1711, composed of residues 1-32 of SEQ ID NO:2): 

APPRLICDSR VLERYLLEAK EAEKITTGCA EH-thioester 
(where Lys24 is modified with a levulinic acid residue) 

Segment SEP-1:1 (GRFN 1711) was synthesized on a thioester-generating 
resin, and Segment SEP- 1:4 (GRFN 1776) on a -OCH 2 -Pam-resin as in Example 1. 

20 Lysines 24 and 126 of these two peptide segments were initially protected with an 
Fmoc group at the e-amino group. After completion of the chain assembly, the 
Fmoc-bearing amino groups were deprotected following standard Fmoc deprotection 
procedures and modified by attachment of levulinic acid to each peptide resin, 
respectively. The peptides were then deprotected and simultaneously cleaved from 

25 the resin support as described in Example 1. The peptides were purified by 
preparative C4 reverse phase HPLC. Fractions containing pure peptide were 
identified using ES-MS, pooled and lyophilized for subsequent ligation. GRFNP32 
[-(EDA-Succ) ls carboxylate] was assembled on a Sasrin carboxyl-generating resin 
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following standard protocols (Rose, K. et al, U.S. Patent Application Serial No. 
09/379,297; Rose, et al., J Am Chem Soc.(1999) 121: 7034). An aminooxyacetyl 
(AoA) moiety was attached to the N-terminal amino group of the polymer by 
coupling a fivefold excess of activated Boc-aminooxyacetic acid. The polymer 
chain was cleaved from the resin support using classic Fmoc-chemistry procedures. 
The polymer chain was purified by preparative C4 reverse-phase HPLC. Fractions 
containing pure polymer were identified using ES-MS, pooled and lyophilized for 
subsequent ligation. 

Segment SEP- 1:4 and GRFNP32 were jointly dissolved at an equimolar ratio 
in 50% aqueous acetonitrile containing 0.1% TFA . The solution was then 
lyophilized. The dried powder was dissolved and the polymer-modified peptide 
separated from unmodified peptide and unrcacted polymer by preparative gradient 
C4 reverse-phase HPLC. Fractions containing the desired oximated product SEP- 
1 :4 + GP32 were identified by ES-MS and pooled and lyophilized. 

Segment SEP-1:1 and GRFNP32 were jointly dissolved at an equimolar ratio 
in 50% aqueous acetonitrile containing 0.1% TFA. The solution was then 
lyophilized. The dried powder was dissolved and the polymer-modified peptide 
separated from unreacted polymer by preparative gradient C4 reverse-phase HPLC. 
Fractions containing the desired oximated product SEP-L1 + GP32 were identified 
by ES-MS and pooled and lyophilized. 

B. Synthesis of Synthetic Erythropoiesis Stimulating Protein SEP-1- 
L30 

SEP-1-L30 was synthesized in solution from four polypeptide segments: 

Segment SEP-1:1+GP32 (GRFN 1711 + GRFNP32, corresponding to residues 1-32 
ofSEQIDNO:2): 

APPRLICDSR VLERYLLEAK EAEK ox ITTGCA EH-thioester (where Lys 24 
is modified at the s-amino group with a levulinic oxime linker group that is 
coupled to GRFNP32 through a levulinic-aminooxyacetyl (Lev-AoA) oxime 
bond denoted K ox ) 
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Segment SEP- 1:2 (GRFN 1712, corresponding to residues 33-88 ofSEQ ID NO:2): 

CSLNEKIT VPDTKVNFYA WKRMEVGQQA VEVWQGLALL 

S E AVLRGQAL LVKS S QPW-thioester (where Cys 33 is Acm protected) 

Segment SEP- 1:3 (GRFN 1713, corresponding to residues 89-116 of SEQ ID 
NO:2): 

CP LQLHVDKAVS GLRSLTTLLR ALGAQK-thioester (where Cys 89 is 
Acm protected) 

Segment SEP:1:4+GP32 (GRFN 1776 + GRFNP32, corresponding to residues 117- 

166 of SEQ ID NO:2): 

CAIS PPDAAK° X AAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
CRTGDR-carboxylate (where Lys' 26 is modified at the s-amino group with a 
levulinic oxime linker group that is coupled to GRFNP32 through a 
levulinic-aminooxycetyl (Lev-AoA) oxime bond denoted K° K and where the 
C-terminal cysteine carries a picolyl (pico) protecting group) 

Synthesis of additional peptides, ligation reactions, carboxymethylation, 
protecting group removal reactions, folding and purification were performed as 
described in Example 1 to yield full-length, folded SEP-1-L30. An analytical 
reverse-phase HPLC chromatogram and an ES-MS spectrum of the folded protein 
product as well as a CD spectrum demonstrated the presence of folded protein 

Example 3 

Synthesis of Synthetic Erythropoiesis Stimulating Protein SEP-1-L26 

A third synthetic erythropoiesis stimulating protein (designated SEP-1-L26) 
was synthesized to contain oxime-forming groups at positions 24 and 126 of SEP-0. 
These groups were then used to form SEP-1-L26, in which the linear polymers 
(EDA-Succ)is carboxylate and (EDA-Succ)g-amide have been joined to the protein 
backbone through oxime linkages at positions 24 and 126, respectively. The 
sequence of the full-length SEP-1 (1-166) is: 
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APPRLICDSR VLERYLLEAK E AEK ox I TT GC A EHCSLNEKIT 

VPDTKVNFYA WKRMEVGQQA VEVWQGLALL SEAVLRGQAL 

LVKSSQPWyP LQLHVDKAVS GLRSLTTLLR ALGAQKv|/AIS 

P P D A AK ox AA P L RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
CRTGDR (SEQIDNO:2) 

where ¥ denotes an non-native amino acid residue consisting of a cysteine 
that is carboxymethylated at the sulfhydryl group, and where K ox denotes a non- 
native lysine that is chemically modified at the e-amino group with an oxime linker 
group coupled to a designated water-soluble polymer through an oxime bond. 

In contrast to SEP-1-L30, the SEP-1-L26 construct was designed to bear a 
smaller and uncharged water-soluble polymer attached at position 126. The polymer 
attached at position 24 was the same as in SEP-1-L30. Assembly of the full-length 
product was as described in Example 2. The structure of SEP-1-L26 is shown in 
Figure 18. 

A. Oximation of GRFN1776 with GRFNP6 and oximation of 
GRFN1711 with GRFNP32 

Oxime formation was performed to attach water-soluble polymers bearing an 
aminooxyacetyl group to peptides carrying a ketone carbonyl group. To accomplish 
this, the following peptide segments were synthesized: 

Segment SEP-1:4 (GRFN 1776; composed of residues 1 17-166 of SEQ ID NO:2): 
CAISPPDAAK AAPLRTITAD TFRKLFRVYS NFLRGKLKLY 
TGEACRTGDR-carboxylate (where Lys 125 is modified with a levulinic 
acid residue at the s-amino group, and where Cys 1 17 is Acm protected) 



Segment SEP-1:1 (GRFN 1711, composed ofresidues 1-32 of SEQ ID NO:2): 

APPRLICDSR VLERYLLEAK EAEKITTGCA EH-thioester 
(where Lys 24 is modified with a levulinic acid residue) 
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Segment SEP-1:1 (GRFN 1711) was synthesized on a thioester-generating 
resin, and Segment SEP-1:4 (GRFN 1776) on a -OCH 2 -Pam-resin as in Example 1. 
Lysines 24 and 126 of these two peptide segments were initially protected with an 
Fmoc group at the s-amino group. After completion of the chain assembly, the 
5 Fmoc-bearing amino groups were deprotected following standard Fmoc deprotection 
procedures and modified by attachment of levulinic acid to each peptide resin, 
respectively, following standard coupling protocols. The peptides were then 
deprotected and simultaneously cleaved from the resin support according to standard 
Boc-chemistry procedures as in Example 1. The peptides were separately purified 
10 by preparative C4 reverse-phase HPLC. For each peptide, fractions containing pure 
peptide were identified using ES-MS, pooled and lyophilized for subsequent 
ligation. 

The water soluble polymer (EDA-Succ) 18 carboxylate (GRFNP32) was 
assembled on a Sasrin carboxy-generating resin following standard protocols (Rose, 

15 K. et al., U.S. Patent Application Serial No. 09/379,297; Rose, et al., J Am Chem 
Soc.(1999) 121: 7034). The water soluble polymer (EDA-Succ) 6 -amide (GRFNP6) 
was assembled on a Sieber amide-generating resin following standard protocols 
(Rose, K. et al, U.S. Patent Application Serial No. 09/379,297; Rose, et al., J Am 
Chem Soc.(\999) 121: 7034). An aminooxyacetyl (AoA) moiety was attached to the 

20 N-terminal amino group of each resin-bound polymer by coupling a fivefold excess 
of activated Boc-aminooxyacetic acid. The two polymer chains were separately 
cleaved from the respective resin supports using classic Fmoc-chemistry 
procedures. Each polymer chain was purified by preparative reverse phase HPLC. 
For each polymer, fractions containing pure polymer were identified using ES-MS, 

25 pooled and lyophilized for subsequent ligation. 

Segment SEP-1:4 and GRFNP6 were jointly dissolved at an equimolar ratio 
in 50% aqueous acetonitrile containing 0.1% TFA. The solution was then 
lyophilized. The dried powder was dissolved and the polymer-modified peptide 
separated from unmodified peptide and unreacted polymer by preparative gradient 
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C4 reverse-phase HPLC. Fractions containing the desired oximated product SEP- 
1:4 + GP6 were identified by ES-MS and pooled and lyophilized. 

Segment SEP-1 :1 and GRFNP32 were jointly dissolved at an equimolar ratio 
in 50% aqueous acetonitrile containing 0.1% TFA. The solution was then 
5 lyophilized. The dried powder was dissolved and the polymer-modified peptide 
separated from unreacted polymer by preparative gradient C4 reverse-phase HPLC. 
Fractions containing the desired oximated product SEP-1 :1 + GP32 were identified 
by ES-MS and pooled and lyophilized. 

B. Synthesis of Synthetic Erythropoiesis Stimulating Protein SEP-1- 

10 L26 

SEP-1-L26 was synthesized in solution from four polypeptide segments: 

Segment SEP:1:1+GP32 (GRFN 1711 + GRFNP32, corresponding to residues 1-32 
ofSEQIDNO:2): 

APPRLICDSR VLERYLLEAK E AEK ox I T T G C A EH-thioester (where Lys 24 
15 is modified at the 8-amino group with a levulinic oxime linker group that is 

coupled to GRFNP32 through a levulinic-aminooxyacetyl (Lev-AoA) oxime 
bond denoted K ox ) 

Segment SEP-1 :2 (GRFN 1712, corresponding to residues 33-88 ofSEQ ID NO:2): 

CSLNEKIT VPDTKVN FY A WKRMEVGQQA VEVWQGLALL 
20 S EAVLRGQAL LVKSSQPW-thioester (where Cys 33 is Acm protected) 

Segment SEP-1 :3 (GRFN 1713, corresponding to residues 89-116 of SEQ ID 
NO:2): 

CP LQLHVDKAVS GLRSLTTLLR ALGAQK-thioester (where Cys 89 is 
Acm protected) 

25 Segment SEP: 1 :4+GP6 (GRFN 1776 + GRFNP6, corresponding to residues 1 17-166 
of SEQ ID NO:2): 

CAIS PPDAAK ox AAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
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CRTGDR-carboxylate (where Lys 126 is modified at the s-amino group with a 
levulinic oxime linker group that is coupled to GRFNP6 through a levulinic- 
aminooxyacetyl (Lev-AoA) oxime bond denoted K ox ' and where the C- 
terminal cysteine [i.e. Cys 161 ] carries a picolyl (pico) protecting group) 

Synthesis of additional peptides, ligation reactions, carboxymethylation, 
protecting group removal reactions, folding and purification were performed as 
described in Examples 1 and 2 to yield full-length, folded SEP-1-L26. An analytical 
C4 reverse-phase HPLC chromatogram and an ES-MS spectrum of the folded 
protein product as well as a CD spectrum demonstrated the presence of folded 
protein. 

Example 4 

Synthesis Of Synthetic Erythropoiesis Stimulating Protein SEP-1-B50 

A fourth synthetic erythropoiesis stimulating protein (designated SEP-1-B50) 
was synthesized. The amino acid sequence of the full-length SEP-1-B50 is the same 
asthatofSEP-l-L30: 

APPRLICDSR VLERYLLEAK E AEK 0X I T T G C A EHCSLNEKIT 

VPDTKVNFYA WKRMEVGQQA VEVWQGLALL SEAVLRGQAL 

LVKSSQPWvj/P LQLHVDKAVS GLRSLTTLLR ALGAQKvj/AIS 

P P D AAK ox A A P L RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
CRTGDR (SEQIDNO:2) 

where W denotes an non-native amino acid residue consisting of a cysteine that is 
carboxymethylated at the sulfhydryl group, and where K ox denotes a non-native 
lysine that is chemically modified at the e-amino group with an oxime linker group 
coupled to a designated water-soluble polymer through an oxime bond. 

However, the protein was derivatized with a branched polymer construct 
having four linear (Succ-EDA) 12 moieties rather than the linear (Succ-EDA) 18 
polymer of SEP-1-L30. Derivatization was accomplished via oxime linkages. 
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Figure 19 is a schematic illustrating the chemical synthesis of the branched 
water-soluble polymer GRFNP29 (GP29) that was joined to the SEP-1-B50 through 
oxime forming ligation as described in detail below. As shown in Figure 19, an 
orthogonally protected lysine U-B precursor was used to generate U-B component 
5 GRFN17 (GP17) bearing an aminooxy acyl as the U group, and having a 3X lysine 
branching core B bearing four pendant thiol groups for subsequent coupling through 
thioether linkages to a linear water-soluble polyamide ethylene oxide construct 
designated GP29, which bears a pendant carboxylate donated by a succinic acid 
residue. Assembly of the full-length product was as described in Example 2. The 
10 structure of SEP-1-B50 is shown in Figure 20. 

A. Synthesis of Template GRFNP17 Carrying Multiple Thiol 
Groups 

The template GRFNP17 was synthesized manually on an amide generating 
(4-methyl)benzhydrylamine(MBHA)-resin on a 0.4mmol scale. Fmoc-Lys(Boc)-OH 

15 was coupled using standard coupling protocols (Schnolzer, M., Int J Pept Protein 
Res. (1992) 40:180-93). 2.1mmol amino acid, 10% DIEA in 3.8 ml 0.5M HBTU 
was used; i.e. 5-fold excess of amino acid. After removal of the Fmoc protecting 
group, Fmoc-Lys(Fmoc)-0H was coupled using standard amino acid coupling 
protocols (2.1 mmol amino acid, 10% DIEA in 3.8 ml 0.5M HBTU; i.e. 5-fold 

20 excess amino acid). After a second Fmoc removal step, Fmoc-Lys(Fmoc)-0H was 
coupled using standard amino acid coupling protocols (4.2 mmol amino acid, 10% 
DIEA in 7.6 ml 0.5M HBTU; i.e. 5-fold excess amino acid relative to free amine). 
After a final Fmoc deprotection step, a five-fold excess (relative to free amines) of 
S-acetyl thioglycolic acid pentafluorophenyl ester (SAMA-oPfp) in DMF was 

25 coupled for 30 minutes. The Boc protecting group of the side chain of the C- 
terminal lysyl residue was removed by two times one minute batch washes with neat 
TFA, followed by neutralization of the resin by washing with 10% DIEA in DMF. 2 
mmol Boc-aminooxyacetic acid and 2 mmol N-hydroxysuccinimide (NHS) were 
dissolved in 3 ml DMF. After addition of 2 mmol DIC (diisopropylcarbodiimide), 

30 the acid was activated for 30-60 minutes. The solution was added to the neutralized 
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resin and coupled 1 hr. Finally, the S-linked acetyl groups were removed with 20% 
piperidine in DMF for 30 minutes. The template was deprotected and 
simultaneously cleaved from the resin support using HF/p-cresol according to 
standard Boc-chemistry procedures in the presence of cysteine as a scavenger for 
5 free aldehyde (SchnSlzer, M., Int J Pept Protein Res. (1992) 40:180-93). The 
recovered polyamide in 50% B [i.e. 50% aqueous acetonitrile containing 0.1%TFA] 
(aldehyde free) was frozen and lyophilized. For purification, the template crude 
product was dissolved in 2 ml 50% B, and 100 ml 100% A [i.e. 0.1% TFA in water] 
was added to dilute the sample (Avoid guanidinium chloride or acetate addition, 
10 since the addition of aldehyde is guaranteed). The template was loaded onto a C4 
preparative reverse-phase HPLC column equilibrated at T = 40°C at 3% B. Salts 
were eluted isocratically and the desired template, GRFNP17 purified in a linear 
gradient. Fractions containing the desired product were identified by ES-MS, pooled 
and lyophilized. 

1 5 B. Synthesis of Branched Water-Soluble Polymer GRNP29 

GRFNP29, a branched (EDA-Succ)i 2 polymer of 15kDa molecular weight 
was synthesized by thioether-generating ligation of purified thiol-containing 
template GRFNP17 and a linear polymer GRFNP31, Br-acetyl-(EDA-Succ) 12 
carboxylate , where GRFNP31 was synthesized on a Sasrin carboxy-generating resin 

20 following standard protocols (Rose, K. et al, U.S. Patent Application Serial No. 
09/379,297; Rose, etal, J Am ChemSoc.Q. 999) 121:7034). 

A 1.3x molar excess (over total thiols) of the purified GRFNP31, Br- 
acetylated (EDA-Succ) 12 , and purified thiol-containing template GRFNP17 were 
jointly dissolved 0.1 M Tris -HC1/ 6 M guanidinium chloride, pH 8.7 at -10 mM 

25 concentration. After dissolution, the solution was diluted threefold (v/v) with 0.1 M 
Tris -HC1, pH 8.7 buffer. The ligation mixture was stirred at room temperature and 
the reaction monitored by reversed-phase HPLC and ES/MS. Additional GRFNP31 
reactant was added on an as-needed basis until the desired reaction product was the 
major product. For workup, 3x (v/v to ligation mix) 0.1M acetate / 6 M 
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guanidinium chloride, pH 4 was added, and the solution was loaded onto a 
preparative C4 reverse-phase HPLC column, and purified with a linear gradient. 
Fractions containing pure GRFNP29 construct were identified using ES-MS, pooled 
and lyophilized. 

5 C. Oximation of GRFN1776 and GRFN171 1 with GRFNP29 

Segments SEP- 1:4, and Segment SEP-1:1 were synthesized as described in 
Example 2. Segment SEP-1:4 and GRFNP29 were jointly dissolved at an equimolar 
ratio in 50% aqueous acetonitrile containing 0.1% TFA . The solution was 
lyophilized. The dried powder was loaded onto a preparative reverse-phase HPLC 

10 column (1 inch diameter). The polymer-modified peptide was separated from 
unmodified peptide and unreactcd polymer by preparative gradient C4 reverse-phase 
HPLC. Fractions containing the desired oximated product SEP:1:4+GP29 were 
identified by ES-MS and pooled. 

Segment SEP-1:1 and GRFNP29 were jointly dissolved at an equimolar ratio 

15 in 50% aqueous acetonitrile containing 0.1% TFA. The solution was frozen and 
lyophilized. The dried powder was dissolved in 50% aqueous acetonitrile containing 
0.1% TFA and loaded onto a preparative GPC (gel permeation chromatography) 
column (1 inch diameter). The polymer-modified peptide was separated from 
unmodified peptide and unreacted polymer by isocratic elution. Fractions containing 

20 the desired oximated product SEP-1 : 1+GP29 were identified by ES-MS and pooled. 

D. Synthesis Of Synthetic Erythropoiesis Stimulating Protein SEP- 
1-B50 

SEP-1-B50 (SEQ ID NO:2) was synthesized in solution from four 
polypeptide segments: 

25 Segment SEP-1 :1+GP29 (GRFN 1711 + GRFNP29, corresponding to residues 1-32 
ofSEQIDNO:2): 

APPRLICDSR VLERYLLEAK EAEK ox ITTGCA EH-thioester (where Lys 24 
is modified at the s-amino group with a levulinic oxime linker group that is 
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coupled to GRFNP29 through a levulinic-aminooxyacetyl (Lev-AoA) oxime 
bond denoted K ox ) 

Segment SEP-1:2 (GRFN 1712, corresponding to residues 33-88 of SEQ ID NO:2): 

CSLNEKIT VPDTKVNFYA WKRMEVGQQA VEVWQGLALL 
SEAVLRGQAL LVKSSQPW-thioester (where Cys 33 is Acm protected) 

Segment SEP- 1:3 (GRFN 1713, corresponding to residues 89-116 of SEQ ID 
NO:2): 

CP LQLHVDKAVS GLRSLTTLLR ALGAQK-thioester (where Cys 89 is 
Acm protected) 

Segment SEP11:4+GP29 (GRFN 1776 + GRFNP29, corresponding to residues 117- 

166 ofSEQIDNO:2): 

CAIS P P DA AK ox AAP L RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
CRTGDR-carboxylate (where Lys 126 is modified at the s-amino group with a 
levulinic oxime linker group that is coupled to GRFNP29 through a 
levulinic-aminooxyacetyl (Lev-AoA) oxime bond denoted K ox ' and where the 
C-terminal cysteine carries a picolyl (pico) protecting group) 

Synthesis of additional peptides, ligation reactions, carboxymethylation, 
protecting group removal reactions, folding and purification were performed as 
described in Examples 1 and 2, except that purification was on a Resource Q 
column, to yield full-length, folded SEP-1-B50 (SEQ ID NO: 2). An analytical C4 
reverse-phase HPLC chromatogram and an ES-MS spectrum of the folded protein 
product SEP-1-B50 as well as a CD spectrum demonstrated the presence of folded 
protein. 

Example 5 

Synthesis Of Synthetic Erythropoiesis Stimulating Protein SEP-3-L42 

A fifth synthetic erythropoiesis stimulating protein (designated SEP-3-L42) was 
synthesized. The amino acid sequence of the full-length SEP-3 protein is: 
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APPRLICDSR VLERYLLEAK EAECITTGCA EHCSLNECIT 

VPDTKVNFYA WKRMEVGQQA VEVWQGLALL SEAVLRGQAL 

LACSSQPWEP LQLHVDKAVS GLRSLTTLLR ALGAQKEAIS 

PPDAACAAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
5 CRTGDR (SEQIDNO:3) 

The cysteine residues in positions: 24, 38, 83, and 126 were modified with 
maleimide-functionalized (EDA-Succ) 18 (GRFNP32) polymer units (via a Michael 
addition reaction) to form SEP-3-L42. The structure of SEP-3-L42 is shown in 
Figure 22. 

10 A schematic illustrating the chemical synthesis of SEP-3-L42 described 

below is shown in Figure 21. Briefly, Figures 21A and 21B show the native 
chemical ligation of the four SEP-3-L42 peptide segments SEP-3:1 through SEP-3:4, 
which generates cysteines at the ligation sites corresponding to all four native 
glycosylation sites in human EPO. In particular, Figure 21B shows full-length 

15 polypeptide having all disulfide-forming cysteines protected, permitting the site- 
specific attachment of four charged linear water soluble polymers (designated 
GRFN32-maleimide (GP32-maleimide)) at the cysteine ligation sites through a 
Michael addition reaction. Figure 21B also shows the final deprotection step of the 
disulfide-forming cysteines, and generation of the full-length, polymer-modified 

20 product. 

In detail, SEP-3 was synthesized in solution from four polypeptide segments: 

Segment SEP-3:1 (GRFN 1747, corresponding to residues 1-37 of SEQ ID NO:3): 
APPRLICDSR VLERYLLEAK EAECITTGCA EHCSLNE-thioester 
(where Cys 7 , Cys 29 , and Cys 33 are Acm protected) 

25 Segment SEP-3:2 (GRFN 1774, corresponding to residues 38-82 of SEQ ID NO:3): 

CIT VPDTKVNFYA WKRMEVGQQA VEVWQGLALL 

SEAVLRGQAL LA-thioester (where Cys 38 is side chain protected with the 
Acm group) 
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Segment SEP-3:3 (GRPN 1749, corresponding to residues 83-125 of SEQ ID 
NO:3): 

CSSQPWBP LQLHVDKAVS GLRSLTTLLR ALGAQKEAIS 
PPDAA-thioester (where Cys 83 is side chain protected with the Acm group) 

Segment SEP-3:4 (GRFN 1750, corresponding to residues 126-166 of SEQ ID 
NO:3): 

CAAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
CRTGDR-carboxylate (where Cys 161 is Pbo [i.e., 4-(CH 3 S(0)-)benzyl- ] 
protected) 

Peptide synthesis. The peptides SEP-3:1 and SEP-3:2 and SEP-3:3 were 
synthesized on a thioester-generating resin by the in situ neutralization protocol for 
Boc chemistry SPPS, using established side-chain protection strategies as described 
in Example 1, with changes in protecting group strategy as noted in the specific 
peptides above. Segment SEP-3:4 was synthesized analogously on a -OCH 2 -Pam- 
resin. The peptides were deprotected and simultaneously cleaved from the resin 
support as described in Example 1. The resulting peptides described above were 
purified by preparative RP-HPLC. Fractions containing pure peptide were identified 
using ES-MS, pooled and lyophilized for subsequent ligation. 

Step 1: Ligation #1. Segment SEP-3:4 and segment SEP-3:3 were dissolved 
in TFE at 15 mM concentration. Saturated phosphate buffer (pH 7.5) containing 6 
M guanidinium chloride and 1% thiophenol was added, resulting in a clear solution 
of the peptide segments. After ligation, the ligation mix (defined as 1 volume) was 
added to 2 volumes of a solution of {2 ml TFE, 6 ml 6 M guanidinium chloride, 100 
mM phosphate containing 25% (3-mercaptoethanol} and incubated for 20 minutes. 
The solution was acidified with a solution of 15 mg/ml TCEP in glacial acetic acid, 
loaded onto a preparative C4 reverse-phase HPLC column and purified with a linear 
gradient. Fractions containing the desired ligated product SEP-3:Acm+SEP- 
3:3+SEP-3:4 were identified by ES-MS and pooled. 

Step 2: Acm-removal #1 For Acm removal, the aqueous acetonitrile solution 
containing the pooled fractions of SEP-3:Acm+SEP-3:3+SEP-3:4 was diluted lx 
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with HPLC grade water, and solid urea was added for a final concentration of 2 
molar. A threefold molar excess (relative to the total expected cysteine 
concentration) of a 30 mg/ml Hg(acetate) 2 solution in 3% aqueous acetic acid was 
added and the solution was stirred for one hour. The solution was then made 20% in 
5 p-mercaptoethanol, loaded onto a semi-preparative C4 reverse-phase HPLC column 
and purified with a step gradient. Fractions containing the desired ligated product 
SEP-3:3+SEP-3:4 were identified by ES-MS and lyophilized overnight. 

Step 3: Ligation #2 Equal amounts of SEP-3:3+SEP-3:4 and SEP-3. 2 were 
jointly dissolved in neat TFE trifluoroethanol at 15 mM concentration. 250 mM 

10 Phosphate buffer (pH 7.5) containing 6 M guanidinium and 1% thiophenol was 
added, resulting in a clear solution of the peptide segments. After one day of 
ligation, the ligation mix (defined as 1 volume) was added to 2 volumes of a solution 
of 10 ml TFE, 10 ml p-mercaptoethanol, 10 ml piperidine and 20 ml 6M 
guanidinium, pH 4, and incubated for 20 minutes to remove any remaining 

15 protecting groups. The solution was acidified with a solution of 15 mg/ml TCEP in 
20% aqueous acetic acid, loaded onto a preparative C4 reverse-phase HPLC column 
and purified with a linear gradient. Fractions containing the desired ligated product 
SEP-3 :Acm+SEP-3:2+SEP-3:3+SEP-3:4 were identified by ES-MS and lyophilized 
overnight. 

20 Step 4 Acm removal For Acm removal, the aqueous acetonitrile solution 

containing the pooled fractions of SEP-3:Acm+SEP-3:2+SEP-3:3+SEP-3:4 was 
diluted lx with HPLC grade water, and solid urea added for a final concentration of 
2 molar. A threefold molar excess (relative to the total expected cysteine 
concentration) of a 30 mg/ml Hg(acetate) 2 solution in 3% aqueous acetic acid was 

25 added and the solution stirred for one hour. The solution was then made 20% in p- 
mercaptoethanol, loaded onto a C4 semi-preparative reverse-phase HPLC column 
and purified with a step gradient. Fractions containing the desired ligated product 
SEP-3 :2+SEP-3 :3 +SEP-3.4 were identified by ES-MS and lyophilized overnight. 

Step 5: Ligation #3 Equal amounts of SEP-3 :2+SEP-3 : 3+SEP-3 :4 and 

30 SEP:3:1 were jointly dissolved in neat TFE at 15 mM concentration. 250 mM 
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Phosphate buffer (pH 7.5) containing 6 M guanidinium and 1% thiophenol was 
added, resulting in a clear solution of the peptide segments. After one day of 
ligation, the ligation mix (defined as 1 volume) was added to 2 volumes of a solution 
of 10 ml TFE, 10 ml p-mercaptoethanol, 10 ml piperidine and 20 ml 6M 
5 guanidinium, pH 4, and incubated for 20 minutes to remove any remaining 
protecting groups. The solution was acidified with a solution of 15 mg/ml TCEP in 
20% aqueous acetic acid, loaded onto a preparative C4 reverse-phase HPLC column 
and purified with a linear gradient. Fractions containing the desired ligated product 
SEP-3:l+SEP-3:2+SEP-3:3+SEP-3:4 were identified by ES-MS and lyophilized 
10 overnight. 

Step 6: Attachment of the Polymer GRFNP32. A maleimide-functionalized 
linear (EDA-Succ)i 8 polymer called GRFNP32-maleimide was prepared by 
functionalizing GRFNP32 with BMPS (3-maleimido propionic acid NHS ester, 
Pierce, USA) following the manufacturers protocols to form a maleimide- 

15 functionalized (EDA-Succ) [8 polymer [i.e., maleimide-(EDA-Succ) ]8 ]. SEP- 
3:l+SEP-3:2+SEP-3:3+SEP-3:4 was dissolved in the minimum amount of TFE 
required. A threefold excess of GRFNP32-maleimide was dissolved in 6M 
guanidinium chloride, 100 mM phosphate, pH 7.5 and added to the TFE solution. 
The progress of the Michael addition reaction was followed by analytical reverse- 

20 phase HPLC. After the reaction was complete, the solution was loaded onto a 
preparative C4 reverse-phase HPLC column and purified with a linear gradient. 
Fractions containing the desired polymer-modified product SEP3 :Acm+SEP- 
3:l+SEP-3:2+SEP-3:3+SEP-3:4+pPEG [i.e. the ligated full-length 166 residue 
polypeptide chain with four copies of GRFNP32 attached to the side chain thiols of 

25 Cys 24 , Cys 3S , Cys 83 and Cys 126 , and thus also called SEP3:Acm+SEP-3:l+SEP- 
3:2+SEP-3:3+SEP-3:4+GP32], were identified by ES-MS and lyophilized overnight. 

Step 7: Pbo removal For Pbo reduction, the lyophilized powder of 
SEP3:Acm+SEP-3:l+SEP-3:2+SEP-3:3+SEP-3:4+GP32 was dissolved in neat TFA 
containing 5% ethanedithiol. The Pbo group was then cleaved by addition of 10% 

30 thioanisole and 15% bromotrimethylsilane for 30 minutes. The solution was dried in 
a rotator-evaporator and taken up in aqueous acetonitrile containing 0.1% TFA. The 
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resulting solution was loaded onto a semi-preparative reverse-phase HPLC column 
and purified with a step gradient. Fractions containing the desired Cys 161 -deprotected 
product SEP3:Acm+SEP-3:l+SEP-3:2+SEP-3:3+SEP-3:4+GP32-Pbo were 
identified by ES-MS and lyophilized overnight. 
5 Step 8: Acm removal For final Acm removal from the side chains of Cys 7 , 

Cys 29 , and Cys 33 , the aqueous acetonitrile solution containing the pooled fractions of 
SEP3:Acm+SEP-3:l+SEP-3:2+SEP-3:3+SEP-3:4+GP32-Pbo was diluted lx with 
HPLC grade water, and solid urea was added for a final concentration of 2 molar. A 
threefold molar excess (relative to the total expected cysteine concentration) of a 30 

10 mg/ml Hg(acetate) 2 solution in 3% aqueous acetic acid was added and the solution 
was stirred for one hour. The solution was then made 20% in p-mercaptoethanol, 
loaded onto a semi-preparative C4 reverse-phase HPLC column and purified with a 
step gradient. Fractions containing the desired ligated, polymer-modified product 
SEP-3 (1-166) were identified by ES-MS and lyophilized overnight. 

15 Step 9 Folding: Full-length ligated, polymer-modified peptide SEP-3 (1- 

166) was dissolved in 200 mM Tris buffer (pH 8.7) containing 6 M guanidinium 
chloride and 20% TFE and a ten-fold molar excess (relative to Cys residues in SEP- 
3) of cysteine. This solution was dialyzed overnight against a solution of 200 mM 
Tris buffer (pH 8.7) containing 3 M guanidinium chloride at room temperature. The 

20 solution was then dialyzed against a solution of 200 mM Tris buffer (pH 8.7) 
containing 1 M guanidinium chloride for 4 hours at 4 °C and finally against 10 mM 
phosphate buffer (pH 7.0) for 4 hours at 4°C to yield the final folded product. 
Folding was verified by electrospray ES-MS and CD spectrometry. 

Step 10 Purification: The folded polypeptide was concentrated 5x in 

25 centricon concentrator vials and loaded on to Resource S cation exchange column 
equilibrated at 10 mM phosphate, pH 7.0. The folded protein was eluted in a linear 
salt gradient to 500 mM NaCl in 10 minutes. Fractions containing the desired folded 
product SEP-3- L42 were identified by SDS-PAGE , and frozen and stored at -80 °C. 
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Example 6 

Bioactivity Assay of Synthetic Erythropoiesis Stimulating Proteins 

The bioactivity of folded synthetic erythropoiesis stimulating proteins, SEP- 
0, SEP-1-L26, SEP-1-L30, SEP-1-B50, and SEP-3-L42 was determined using UT/7 
5 and 32D 103197 cell lines, in factor-dependent cell-line proliferation assays using 
commercial recombinant erythropoietin as a control standard. UT-7 is a human 
megakaryoblastic leukemia cell line with absolute dependence on one of interleukin- 
3, granulocyte-macrophage colony-stimulating factor (GM-CSF), or erythropoietin 
(EPO) for growth and survival (Miura Y, et al., Acta Haematol (1998) 99:180-184); 

10 Komatsu N, et al., Cancer Res. (1991) 51:341-8). 32D 103197 is a murine 
hemopoietic cell line (Metcalf, D. Int J Cell Cloning (1992) 10:116-25). 

Stock-solutions of the SEP constructs were made in Iscove's modified 
Dulbecco's medium (IMDM), 10% FBS (Fetal bovine serum), glutamine and 
Penstrep, and serial 2x dilutions of these stock solutions were added to multi-well 

15 plates to which human UT/7 EPO cells at a concentration of 5000 cells/50 pi were 
added. The plates were incubated at 37 °C in the presence of 5% C0 2 and monitored 
daily for growth. After four days, 20 pi 2.5 mg/ml MTT (methylthiazol tetrazolium) 
in PBS (phosphate buffered saline) was added and the plates were incubated for four 
hours. 150 pi IPA was added and the absorbance of each well was read at 562 nm. 

20 The ED50 (effective dose to reach 50% of maximum effect) values for the SEP 
compounds was determined and compared to that of CHO (Chinese hamster ovary) - 
cell produced rhEPO (recombinant human erythropoietin). The results from these 
experiments demonstrated that all of the synthetic erythropoiesis stimulating proteins 
exhibited bioactivity. ED50 results for SEP-0, SEP-1-L26, SEP-1-L30, and SEP-1- 

25 B50 are shown in Table VI. 
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Table VI 



Erythropoiesis 
Stimulating Protein 


In Vitro ED50 Values (pM) 


UT-7 (Human) 
Cells 


32D 103197 
(Mouse) Cells 


SEP-0 


1,570 


863 


SEP-1-L26 


46.5 


100.8 


SEP-1-L30 


71.5 


182.5 


SEP-1-B50 


182 


6200 


rhEPO 


32.5 


136.3 



Additional results from these experiments when normalized for peptide 
content only using an extinction coefficient for calculating protein concentration by 
absorbance for polymer-modified SEP-1-L30 are shown in Table VII and Figure 28 
5 for SEP-0, SEP-1-L26, SEP-1-L30, SEP-1-B50, SEP-3-L42, and SEP-1-B51 
(synthesis of the SEP-1-B51 compound is described in Example 7 below). 
Collectively, these in vitro assays demonstrate the differential effect of varying the 
polymer structures and sites of attachment on in vitro bioactivity, where these 
compounds have the following relative order of potency when tested for (1) human 
10 EPO receptor activity (from most potent to least potent): SEP-1-L26 « SEP-1-L30 > 
SEP-1-B50 » SEP-1-B51 > SEP-3-L42 > SEP-0; and (2) mouse EPO receptor 
activity (from most potent to least potent): SEP-1-L26 > SEP-1-L30 > SEP-0 > SEP- 
3-L42 > SEP-1-B50 m SEP-1-B51. 

Table VII 





In Vitro ED50 Values (nz/ml) 


Erythropoiesis 


UT-7 Cells 


32D 103197 Cells 


Stimulating Protein 


(Human EPO R) 


(Mouse EPO R) 


SEP-0 


11 


20 


SEP-1-L26 


0.8 


1 


SEP-1-L30 


0.8 


4 


SEP-3-L42 


7 


70 


SEP-1-B50 


2.5 


125 


SEP-1-B51 


2 


125 



WO 02/20033 



PCT7US01/21930 



The results in Table VII and Figure 28 also show a significant difference in 
receptor preference for the polymer-modified SEP constructs compared to the non- 
polymer modified construct SEP-0. In particular, SEP-1-L26 is modified with 
different linear polymers attached at corresponding human EPO glycosylation site 
5 position 24 (a 5.5 kDa linear polymer with a pendant negative charge attached at this 
site) and position 126 (a 1.8 kDa linear polymer with a pendant neutral charge 
attached at this site); this construct had a similar activity against both human and 
mouse receptors. SEP-1-L30 is modified with a 5.5 kDa linear polymer having a 
pendant negative charge at corresponding human EPO glycosylation site positions 

10 24 and 126; this construct was about 5X more active against the human receptor 
compared to the mouse receptor. SEP-3-L42, which is modified with a 5.5 kDa 
linear polymer having a pendant negative charge at' corresponding human EPO 
glycosylation site positions 24, 38, 83 and 126 was about 10X more active against 
the human receptor compared to the mouse receptor. The largest differential was 

15 observed for SEP-1-B50 and SEP-1-B51, which are modified with branched, 
negatively charged 15 kDa polymers attached at corresponding human EPO 
glycosylation site positions 24 and 126, and have a pi of approximately 5 (similar to 
human EPO); these two constructs were ~ 60X more active against the human 
receptor compared to the mouse receptor. 

20 As mouse EPO is missing a glycosylation site at a position corresponding to 

human EPO position 126 (mouse EPO has non-glycosylated proline instead of the 

0- glycosylated serine found in human EPO), the altered receptor activity may be due 
in part to this difference, and thus preference of the mouse receptor for an EPO that 
is missing O-linked glycosylation at corresponding position 126. This is in line with 

25 the comparison of SEP-1-L26 and SEP-1-L30 to SEP-1-B50 and SEP-1-B52. For 
example, SEP-1-L26 has a 1.8 kDa uncharged polymer at position 126 whereas SEP- 

1- L30 has a 5.5 kDa negatively charged polymer at position 126. Although 
equivalent activity was observed against the human EPO receptor, a 4-fold 
difference was found against the mouse EPO receptor. The largest difference was 

30 observed for the SEP-1-B50 and SEP-1-B51 constructs, with negatively charged 15 
kDa polymers attached at corresponding human EPO glycosylation site position 126. 
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Addition of the 5.5 lcDa negatively charged polymers at sites corresponding to all 
four native glycosylation positions in human EPO, i.e., 24, 38, 83 and 126 as with 
the SEP-3-L42 construct reduced preference for mouse even further, but less than 
the SEP-1-B50 and SEP-1-B51 constructs. These results demonstrate that receptor- 
specificity can be modulated by precision modification of sites corresponding to 
glycosylation sites in a biologically produced, glycosylated protein, and by adjusting 
the size, the nature of branching and charge of the polymer. 

Example 7 

Synthesis Of Synthetic Erythropoiesis Stimulating Protein SEP-1-B51 

A. Composition of SEP-1-B51. A sixth synthetic erythropoiesis 
stimulating protein (designated SEP-1-B51) was synthesized. The amino acid 
sequence of the full-length SEP-1-B51 is the same as that of SEP-1-B50: 

APPRLICDSR VLERYLLEAK EAEK ox ITTGCA EHCSLNEKIT 

VPDTKVNFYA WKRMEVGQQA VEVWQGLALL SEAVLRGQAL 

LVKSSQPWyP LQLHVDKAVS GLRSLTTLLR ALGAQKvj/AIS 

PPDAAK^AAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 

CRTGDR (SEQ ID NO: 2) 

where W denotes an non-native amino acid residue consisting of a cysteine 
that is carboxymethylated at the sulfhydryl group, and where K ox denotes a non- 
native lysine that is chemically modified at the g-amino group with an oxime linker 
group coupled to a designated water-soluble polymer through an oxime bond. 

However, in contrast to SEP-1-B50, the protein was derivatized with a 
branched polymer construct having four linear (Succ-TTD)i 2 -Succ-Alanine-OH [i.e., 
(Succ-TTD)i2-Succ-Ala-OH, or (Succ-TTD)i 2 Succ-Ala] polymers coupled through 
amide bonds to the lysine branching core. Coupling of the linear polymers to the 
branching core through amide bonds was designed to improve stability. The four 
linear polymers also are designed to carry an alanine moiety and pendant negative 
charges, with the negative charges mimicking sialic acids and the alanine a 
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hydrophobic character similar to pendant sugar moieties of carbohydrate chains. The 
branched polymers also were designed to have a water-soluble spacer between the 
branching core and the protein backbone attachment site to improve synthesis and 
handling properties of the branching core template, and to mimic the spacing found 
5 in natural carbohydrate chains between the site of sugar attachment to the protein 
backbone and the first branch point of the carbohydrate. 

Figure 23 is a schematic illustrating the chemical synthesis of the branched water- 
soluble polymer GRFNP41 (GP41) that was joined to the SEP-1-B51 through 
oxime-forming ligation as described below. Assembly of the full-length product was 
1 0 as described in Example 2. The structure of SEP- 1 -B5 1 is shown in Figure 24. 

B. Synthesis of (Succ-TTD)i 2 Succ-AlaOtBu (GRFNP39) for 
coupling to the branching template (GRFNP40). 

Linear polymer (Succ-TTD)i 2 Succ-AlaOtBu (GRFNP39) was synthesized on 
a 0.5 mmol scale on Sasrin acid labile, carboxylic acid-generating resin. 0.5mmole 

15 (-0.5 grams) Sasrin acid labile, carboxylic acid-generating polystyrene resin 
(hydroxyl substitution 1.02 mmole/g;) was swelled in DMF for 15 minutes, and then 
drained. To this hydroxyl-functionalized resin was added 450 mg (4.5 mmole) 
succinic anhydride and 488 mg (4 mmole) 4-(dimethylamino)pyridine dissolved in 8 
ml of DMF containing 500 microliter (3.9 mmole) DIEA (diisopropylethylamine) 

20 and allowed to react for 30 minutes with vortex agitation, and drained. The coupling 
was repeated and excess reactants and soluble coproducts were removed by a 1 
minute vortexing flow wash with DMF (~50ml) and then drained. The HOOC- 
CH 2 CH 2 CO-0-resin (0.5 mmole) was activated by addition of 8 ml of fresh 1.0 M (8 
mmole) CDI (carbonyldiimidazole) solution in DMF and allowed to react for 40 

25 minutes, and then drained. Excess reactants and soluble coproducts were removed 
by a 1 minute vortexing flow wash with DMF (~50ml), then drained. 4 ml (4 grams, 
18.2 mmole) (4,7,10)-trioxatridecane-l,13diamine (TTD) dissolved in 4 ml 0.5M (2 
mmole) HOBT solution in DMF was added and allowed to react with vortex 
agitation for 30 minutes, then drained. Excess reactants and soluble coproducts were 

30 removed by a 1 minute vortexing flow wash with DMF (~50ml), then drained. 
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Succinic anhydride (450 mg , 4.5 mmole) dissolved in 8 ml of 0.5M (4 mmole) 
HOBT (N-hydroxybenzotriazole) solution containing 500 microliter (3.9 mmole) 
DIEA was added to the resin and allowed to react with vortex agitation for 15 
minutes, then drained. The three steps (CDI activation; TTD coupling; succinic 
5 anhydride reaction) were repeated eleven times [i.e. a total of twelve times]. Excess 
reactants and soluble coproducts were removed by a 1 minute vortexing flow wash 
with DMF (~50ml), and drained. The HOOC-CH 2 CH 2 CO(TTD-succinyl)i 2 -0-resin 
(0.5 mmole) was activated with 8ml of fresh 1.0 M (8 mmole) CDI solution in DMF 
and allowed to react for 40 minutes, and then drained. Excess reactants and soluble 

10 coproducts were removed by a 1-minute vortexing flow wash with DMF (~50ml), 
then drained. 2.5 mmole H-AlaOfBu.HCl was dissolved in 4.75 ml 0.5 M (2.375 
mmole) HOBT in DMF containing 150 microliter (111 mg, 0.825 mmole) DIEA, 
and allowed to react with the CDI-activated HOOC-CH 2 CH 2 CO(TTD-succinyl)i 2 -0- 
resin (0.5 mmole) for 1 hour with vortex agitation, then drained. Excess reactants 

15 and soluble coproducts were removed by a 1-minute vortexing flow wash with DMF 
(~50ml), and then drained. The product tertBuOOC-CH(CH3)-NH-OC- 
CH 2 CH 2 CO(TTD-succinyl)i 2 -0-resin was washed extensively with DCM 
(dichlormethane), drained and then the resin was dried under vacuum to constant 
weight. Typical weight of product-resin was around 2 grams. 

20 The linear (Succ-TTD) ]2 Succ-AlaOtBu polymer was cleaved from the resin 

support according to standard Fmoc-chemistry procedures using 4% TFA in DCM. 
The precipitated crude product was dissolved in 50% aqueous acetonitrile containing 
0.1% TFA and lyophilized. The lyophilized polymer was dissolved in a small 
amount of 50% aqueous acetonitrile containing 0.1% TFA and diluted to reduce the 

25 concentration of organic below 1%. The crude product was loaded onto a C4 
preparative reverse-phase column equilibrated at T = 40°C at 3% B. Salts were 
eluted isocratically and the desired template was purified with a linear gradient of 
20-35 % Buffer B (acetonitrile containing 0.1% TFA) versus 0.1% aqueous TFA 
over 60 minutes. Fractions containing the desired (Succ-TTD) 12 -Succ-AlaOtBu 

30 material (GRFNP39) were identified by ES-MS, frozen and lyophilized. 
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C. Synthesis of a template carrying multiple amine groups for 
branching and a protected aminooxy group for (later) attachment 
to the protein (GRFNP40) 

The template GRFNP40 was synthesized manually on a Sieber amide- 
5 generating resin on a 0.4 mmol scale. There was a one-minute flow-washing step 
with DMF between every coupling, deprotection and activation step. 2 mmol N a - 
Fmoc-N p (Boc-aminooxyacetyl)-L-diaminopropionic acid was coupled to the resin, 
using 30 minute NHS-ester pre-activation with 2 mmol DIC and 2 mmol NHS in 
DCM /DMF (dimethylformamide), for 1 hour. After removal of the Fmoc protecting 

10 group (2x3 minutes 20%v/v piperidine in DMF) and washing, 4 mmol succinic 
anhydride dissolved in 8ml of 0.5M HOBT (N-hydroxybenzotriazole) solution 
containing 2.2 mmol DIEA was coupled to the resin for 10 minutes. After this step, 
the carboxyl group was activated with 8 ml of fresh 0.5 M (4 mmole) CDI solution 
in DMF. 4ml (18.2 mmole) TTD was added in 4 ml 0.5M HOBT solution in DMF 

1 5 and coupled for 30 minutes. 

Fmoc-Lys(Fmoc)-0H (2 mmol ) was coupled to the resulting amino-TTD- 
Succ-Dpr(BocAoA)-resin (where Dpr = diaminopropionic acid) using 30 minutes 
pre-activation with 2 mmol DIC and 2 mmol NHS in DMF and coupling for 1 hour. 
This coupling was repeated once. After an Fmoc removal step (2x3 minutes 

20 20%v/v piperidine in DMF), 4 mmol Fmoc-Lys(Fmoc)-0H was coupled as described 
above, including recoupling. After a final Fmoc protection step (2x3 minutes 
20%v/v piperidine in DMF), the template was cleaved from the resin support 
according to standard Fmoc-chemistry procedures using 4% TFA in DCM. The 
precipitated product was dissolved in 50% aqueous acetonitrile containing 0.1% 

25 TFA and lyophilized. The lyophilized crude product was dissolved in a small 
amount of 50% aqueous acetonitrile containing 0.1% TFA and diluted to reduce the 
concentration of organic below 1%. The template was loaded onto a C4 preparative 
reverse-phase HPLC column equilibrated at T = 40°C at 3% Buffer B (0.1%TFA in 
acetonitrile). Salts and other non-amide material were eluted isocratically, and the 

30 desired template was purified with a linear gradient of 5-12 % Buffer B (acetonitrile 
containing 0.1% TFA) versus 0.1% aqueous TFA over 60 minutes. Fractions 
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containing the desired Lys-Lys(Lys)-TTD-Succ-Dpr(BocAOA).amide material 
(GRFNP40) were identified by ES-MS, frozen and lyophilized. 

D. Assembly and deprotection of the amide-coupled branched 
polymer (GRFNP41) 

5 GRFNP41, a branched (TTD-Succ) 49 - polymer of 16kDa molecular weight 

was synthesized by coupling of GRFNP39 [(Succ-TTD) 12 -Succ-AlaOtBu] to the 
purified template GRFNP40. Purified (Succ-TTD) 12 -Succ-AlaOtBu (GRFNP39) 
(l.Ommole) dissolved in DMSO (dimethylsulfoxide) at 60°C to a concentration of 
20 mg/ml was activated with 0.95mole of HATU {<9-(7-azabenzotriazol-l-yl)- 

10 1,1,3,3-tetramethyluronium hexafluorophosphate} in DMSO at a concentation of 10 
mg/ml in the presence of a twenty- fold (molar) excess of DIBA. Purified template 
GRFNP40 (0.24mole) dissolved in DMSO at a concentration of 3.9 mg/ml was 
added immediately. Progress of the reaction was monitored by C4 analytical 
reverse-phase HPLC and ES-MS. Typically, the coupling was complete within 

15 minutes. For work-up, 4 volumes (relative to ligation mix) 0.1M sodium acetate / 6 
M guanidinium chloride, pH 4 was added, and the solution was loaded onto a 
preparative (C4) reverse-phase HPLC column. Salts and other non-amide containing 
material were eluted isocratically and the desired branched polymer product was 
purified with a linear gradient of 20-35 % Buffer B (acetonitrile containing 0.1% 

20 TFA) versus 0.1% aqueous TFA in 80 minutes. Fractions containing the desired 
material were identified by ES-MS , frozen and lyophilized. 

The resulting purified polymer was dissolved in neat TFA at a concentration 
of 1 mg/ml for 1 hour to remove the Boc group from the aminooxyacetyl moiety and 
to remove the tertButyl ester groups from the -Ala-OtBu moieties. The solution 

25 was rotated to dryness in a rotator evaporator and the dried polymer was dissolved in 
50% Buffer B (acetonitrile containing 0.1% TFA). Aminooxyacetic acid (a.k.a. 
'Carboxymethoxylamine') was added to a final concentration of 0.5M to scavenge 
adduct-forming impurities. After 20 minutes, the solution was diluted out with 3.3 
volumes of Buffer A (0.1%TFA in water), and loaded onto a preparative C4 reverse 

30 phase HPLC column and pumped at 10% Buffer B until all the aminooxyacetic acid 



PCT7US01/21930 



was removed, then the polymer was purified with a step gradient from 15% to 45% 
Buffer B versus 0.1% aqueous TFA over 80 minutes. Pooled fractions containing 
the desired branched polymer material (GRFNP41) were frozen and lyophilized. 

E. Oxime-forming ligation (Oximation) of the peptide segments 
5 GRFN1776 and GRFN1711 with the branched polymer 

GRFNP41 

Segments SEP- 1:4 (GRFN1776, composed of residues 117-166 of SEQ ID 
NO:2 and Segment SEP-1:1 (GRFN1711, composed of residues 1-32 of SEQ ID 
NO:2 were synthesized as described above in Example 1 using standard in situ 

10 neutralization Boc chemistry SPPS. Peptide GRFN1776 was synthesized on an - 
OCH2-Pam-Resin following standard protocols also as described above. Peptide 
GRFN1711 was synthesized on a thioester-generating resin following standard 
protocols as described above. Segment GRFN1776 containing a Ievulinic acyl 
moiety on Lys 120 and the AoA-containing GRFNP41 were jointly dissolved at an 

15 equimolar ratio in 50% aqueous acetonitrile containing 0.1% TFA. The solution was 
lyophilized. The dried powder crude product was redissolved in 50% aqueous 
acetonitrile containing 0.1% TFA and loaded onto a preparative reverse-phase (C4) 
HPLC column. The polymer-modified peptide was separated from unmodified 
peptide and unreacted polymer by preparative gradient reverse phase HPLC. 

20 Fractions containing the desired oxime-linked (oximated) product SEP-1:4+GP41 
were identified by ES-MS and pooled and lyophilized. 

Segment GRFN1711 containing a Ievulinic acyl moiety on Lys 24 and the 
AoA-containing GRFNP41 were jointly dissolved at an equimolar ratio in 50% 
aqueous acetonitrile containing 0.1% TFA. The solution was frozen and lyophilized. 

25 The dried powder was dissolved in 50% aqueous acetonitrile containing 0.1% TFA 
and loaded onto a C4 preparative reverse-phase HPLC column. The polymer- 
modified peptide was separated from unmodified peptide and unreacted polymer by 
preparative gradient elution C4 reverse-phase HPLC. Fractions containing the 
desired oxime-linked (oximated) product SEP-1:1+GP41 were identified by ES-MS, 

30 and pooled and lyophilized. 
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F. Synthesis Of Synthetic Erythropoiesis Stimulating Protein SEP- 
1-B51 

SEP-1-B51, having (SEQ ID NO:2), was synthesized in solution from four 
polypeptide segments: 

5 Segment SEP-1:1+GP41 (GRFN 1711+GRFNP41; corresponding to residues 1-32 
of SEQ ID NO:2): 

APPRLICDSR VLERYLLEAK EAEK° X ITTGCA EH-thioester (where 
Lys 24 has a levulinic acyl pendant moiety oxime-linked to the branched 
polymer GRFNP41, as denoted by K""; and where His 32 is Dnp protected) 

10 Segment SEP-1:2 (GRFN 1712; corresponding to residues 33-88 of SEQ ID NO:2): 
CSLNEKIT VPDTKVNFYA WKRMEVGQQA VEVWQGLALL 
SEAVLRGQAL LVKSSQPW-thioester (where Cys 33 is Acm protected; and 
where the three Trp residues are formyl protected) 

Segment SEP-1:3 (GRFN 1713, corresponding to residues 89-116 of SEQ ID 
15 NO:2): 

CP LQLHVDKAVS GLRSLTTLLR ALGAQK-thioester (where Cys 89 is 
Acm protected; and where His 94 is Dnp protected) 

Segment SEP-1:4+GP41 (GRFN 1776+GRFNP41, corresponding to residues 117- 
166 of SEQ ID NO:2): 
20 CAIS PPDAAK ox AAPL RTITADTFRK LFRVYSNFLR 

GKLKLYTGEA CRTGDR-carboxylate (where the C-terminal cysteine (i.e. 

Cys 161 ) carries a picolyl (pico) protecting group, and where Lys 126 has a 

levulinic acyl pendant moiety oxime-linked to the branched polymer 

GRFNP41, as denoted by K ox ) 

25 Synthesis of additional peptides, ligation reactions, carboxymefhylation, 

protecting group removal reactions, folding and purification to yield full-length, 
folded SEP-1-B51 (SEQ ID NO:2) were performed as described in Examples 1-4, 
with the following modifications: 
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Step 1: Ligation #1 Segment SEP-L4+GP41 was dissolved in TFE (1 
volume) at 3 mM concentration. Segment SEP-1:3 was dissolved in 2 volumes of 
300mM Na phosphate buffer (pH 7.9) containing 6 M guanidinium chloride to a 
concentration of 2.25 mM. The two solutions were mixed, giving final segment 
5 concentrations of ImM SEP-1.-4+GP41 and 1.5mM SEP-1.3, and 1% thiophenol 
was added, resulting in a solution ('3 volumes') of the peptide segments at a pH of 
6.8-7.2. Reaction was allowed to proceed overnight at room temperature. After 
ligation, p-mercaptoethanol (3 volumes) was added to the ligation mix, followed by 
3 volumes of 6 M guanidinium chloride 300 mM Na phosphate pH7.9 buffer, and 

10 TCEP was added (0.25 by weight of the total weight of peptide segments), and the 
solution stirred for 20 minutes. The solution was acidified to pH4.0+/-0.1 with 0.6 
volumes of glacial acetic acid to give a clear solution to which was added 30 
volumes of pH4.0, lOOmM Na acetate dilution buffer, 6M in guanidinium chloride. 
The resulting solution was pumped onto a preparative reverse-phase (C4) HPLC 

15 column. Buffer was pumped at 5% B [thus the remainder is 95% Buffer A 
(0.1%TFA in water)] until all non-peptide materials had eluted from the column, 
then the ligation product was purified by a gradient of 25-45% Buffer B over 80 
minutes. Fractions containing the desired ligated product (Cys 89 (Acm)){SEP- 
1 :3+SEP-l :4+GP41 } were identified by electrospray mass spectrometry and pooled. 

20 Step 2: Acm-removal #1 For Acm removal, the aqueous acetonitrile 

solution containing the pooled fractions of (Cys 89 (Acm)){SEP-l:3+SEP-l:4+GP41} 
was diluted lx with HPLC grade water, and solid urea was added for a final 
concentration of 2 M. A threefold molar excess (relative to the total cysteine 
concentration) of a 30 mg/ml Hg(acetate) 2 solution in 3% aqueous acetic acid was 

25 added and the solution was stirred for one hour. The solution was then made 20% in 
(3-mercaptoethanoI, loaded onto a semi-preparative reverse-phase HPLC column and 
pumped at 25% Buffer B until all non-peptide material had eluted, and the product 
was then purified with a step gradient to 50% Buffer B. Fractions containing the 
desired ligated product {SEP-1:3+SEP-1:4+GP41} were identified by electrospray 

30 mass spectrometry, pooled and lyophilized. 
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Step 3: Ligation #2 The {SEP-1:3+SEP-1:4+GP41} product [i.e. 1713- 
1776-GP41] from Step 2 was dissolved in TFE (1 volume) at 3 mM concentration. 
Segment SEP-1:2 (segment GRFN1712) was dissolved in 2 volumes of 300mM Na 
phosphate buffer (pH 7.9) containing 6 M guanidinium chloride to a concentration of 
5 2.25 mM. The two solutions were mixed, giving final segment concentrations of 
ImM {SEP-1:3+SEP-1:4+GP41} and 1.5mM SEP-1:2, and 1% thiophenol was 
added, resulting in a solution ('3 volumes') of the peptide segments at a pH of 6.8- 
7.2. Reaction was allowed to proceed overnight at room temperature. Then the 
ligation mix was diluted with 3 volumes (i.e. relative to the volume of TFE used 

10 above) of dilution buffer: lOOmM sodium acetate pH4.0 containing 6M guanidinium 
chloride, and was then added to a solution, cooled to 4°C, consisting of 3 volumes 
TFE, 3 volumes p-mercaptoethanol, 3 volumes piperidine and 6 volumes of 6M 
guanidinium chloride, lOOmM Na acetate pH 4.0 and was stirred for 20 minutes at 
room temperature to remove any remaining protecting groups. The solution was 

15 acidified with 1.8 volumes of frozen glacial acetic acid to give a clear solution to 
which was added TCEP (0.25 by weight of the total weight of peptide segments), 
and the solution was incubated for 20 minutes. Then 30 volumes of pH4.0, lOOmM 
Na acetate dilution buffer, 6m in guanidinium chloride was added and the solution 
was mixed.. The resulting solution was pumped onto a preparative reverse-phase 

20 (C4) HPLC column. Buffer was pumped at 30% C (Buffer C: 0.1% TFA in 60% 
isopropanol/30% acetonitrile/10%water) until all non-peptide materials had eluted 
from the column, then the ligation product was purified by a gradient of 37-57%% 
Buffer C over 80 minutes. Fractions containing the desired ligated product 
(Cys 33 (Acm)){SEP-l:2+SEP-l:3+SEP-l:4+GP41) were identified by ES-MS and 

25 pooled and lyophilized. 

Step 4: Carboxymethylation of residues Cys 89 and Cys" 7 The 
(Cys 33 (Acm)){SEP-l:2+SEP-l:3+SEP-l:4+GP41} [i.e. (Cys 33 (Acm))-1712-1713- 
1776-GP41] was dissolved in TFE at 1 mM concentration. 10 volumes of 300 mM 
Na phosphate buffer (pH 7.9) containing 6 M guanidinium chloride was added. A 

30 25-fold excess (over sulfhydryl groups) of bromoacetic acid dissolved in methanol at 
75mg/ml was added, and the solution was allowed to react with stirring for two 
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hours at room temperature The reaction mixture was then diluted with 11 volumes 
of lOOmM Na acetate pH4.0, 6M guanidinium chloride, and loaded onto a 
preparative reverse-phase (C4) HPLC column and pumped at 20% Buffer C until all 
non-peptide components had eluted, then the ligation product was purified with a 
5 step gradient to 55% Buffer C. Fractions containing the desired modified product 
(Cys 33 (Acm); Y 89 ' 117 ){SEP-1:2+SEP-1:3+SEP-1:4+GP41} were identified by ES- 
MS and pooled and lyophilized. 

Step 5: Picolyl Removal Zinc dust ( 71 milligrams per milligram of 
peptide) was activated in 2M HC1 for 5 minutes, the activation was repeated once, 

10 then the zinc was washed for 10 minutes with 6M guanidinium chloride, 50mM 
glycine pH2.2 containing (freshly added) 35 mg/ml L-methionine and 35 mg/ml Na 
dodecanoylsarcosine and 10%v/v TFE. to remove excess acid. The wash was 
repeated once. The Cys 161 (Pico)-containing peptide (Cys 33 (Acm); ¥ 89,117 ){SEP- 
1:2+SEP-1:3+SEP-1:4+GP41} was dissolved in neat TFE at about 30 mg/ml 

15 concentration. The solution was diluted with 4 volumes (relative to TFE) 6M 
guanidinium chloride, 50mM glycine pH2.2 containing (freshly added) 35 mg/ml L- 
methionine and 35 mg/ml Na dodecanoylsarcosine and 10%v/v TFE. The solution 
was added to the activated zinc powder and stirred for 50 minutes at room 
temperature. The reaction was monitored by analytical C4 reverse-phase HPLC of 

20 aliquots (treated with an equal volume of |3-mercaptoethanol and a few grains of 
TCEP, then diluted with 2 volumes of 6M guanidinium chloride, lOOmMNa acetate 
pH4.0 for analysis) at ~lhr intervals and was complete after 2 to 5 hours. After 
picolyl group removal was complete, as shown by ES-MS analysis of the analytical 
HPLC peaks (i.e. loss of 91Da mass), the supernatant was removed by filtration and 

25 the remaining Zn powder was washed 3 times with 6M guanidinium chloride, pH 4, 
100 mM acetate containing 35 mg/ml L-methionine and 35 mg/ml dodecylsarcosine 
containing 20% TFE. BioRad SM-2 beads (approximately one-third the volume of 
the solution) were added to the combined supernatant and washes and stirred at room 
temperature for 30 minutes, then filtered, p-mercaptoethanol was added to 10%v/v, 

30 then 0.25x(combined weight of peptides) of TCEP was added and the solution 
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stirred for 5 minutes at room temperature. The solution was diluted 1 : 1 with an equal 
volume of lOOmM Na acetate pH4.0, 6M guanidinium chloride and loaded onto a 
preparative (C4) reverse-phase HPLC column and 35% Buffer C pumped until all 
the non-peptide material was eluted, and the desired product was purified with a step 
5 gradient to 55% Buffer C. Fractions containing the desired Cys 161 -deprotected 
product (Cys 33 (Acm); Y S9 '" 7 ){SEP-l:2+SEP-l:3+SEP-l:4+GP41-Pico} were 
identified by electrospray mass spectrometry and pooled. 

Step 6: Acm-removal #2 The pooled solution of (Cys 33 (Acm); 
x F 89 - U7 ){SEP-l:2+SEP-l:3+SEP-l:4+GP41-Pico} [{ £ (Cys 33 (Acm) . y 

10 1712-1713-1 776-GP4 1 ] was diluted 3x with HPLC grade water, and solid urea was 
added for a final concentration of 2M. A threefold molar ratio (relative to the total 
cysteine concentration) of a 30 mg/ml Hg(acetate) 2 solution in 3% aqueous acetic 
acid was added and the solution was stirred for one hour at room temperature. The 
solution was then made 20% in p-mercaptoethanol, and was loaded onto a semi- 

15 preparative (C4) reverse-phase HPLC column. Buffer C (20%) was pumped until all 
non-peptide material had been eluted, and then the desired product was purified with 
a step gradient to 55% Buffer C. Fractions containing the desired product (Cys 33 ; 
¥ 89,U7 ){SEP-l:2+SEP-l:3+SEP-l:4+GP41-Pico} were identified by ES-MS, 
diluted 2x (v/v) with water containing 2x (w/w relative to peptide mass) DPC 

20 (dodecylphosphocholine) and were lyophilized overnight. 

Step 7: Ligation #3 (Cys 33 ; T 89 ' 117 ){SEP-1:2+SEP-1:3+SEP-1:4+GP41- 
Pico} [i.e. (Cys 33 ; Y 89 ' 117 )-1712-1713-1776-GP41] was dissolved in neat TFE (1 
volume) at 3 mM concentration. SEP-1:1 [i.e. 1711-GP41] was dissolved in 2 
volumes 300 mM Na phosphate buffer (pH 7.9) containing 6 M guanidinium 

25 chloride. The solutions were combined, and l%v/v thiophenol was added. The 
ligation mix was stirred overnight at room temperature. To the solution was added 3 
volumes (relative to TFE volume above) of p-mercaptoethanol, and 9 volumes of 
ligation buffer (300 mM Na phosphate buffer pH 7.9, containing 6 M guanidinium 
chloride). To the solution was added 0.25x(combined weight of peptides) TCEP and 

30 the solution was stirred at room temperature for 20 minutes. Glacial acetic acid (0.6 
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volumes) was added to acidify the solution to pH4.0, and the solution was then 
diluted with 30 volumes of lOOmM Na acetate pH4.0 containing 6M guanidinium 
chloride, and loaded onto a preparative (C4) reverse-phase HPLC column. Buffer C 
(25%) was pumped until all non-peptide material was eluted, then the ligation 
5 product was purified with a linear gradient of Buffer C from 35-55% over 80 
minutes.. Fractions containing the desired ligated product SEP-1 (1-166) 
(T 89 ' 117 ){SEP-1:1(+GP41)+SEP-1:2+SEP-1:3+SEP-1:4+GP41}: (SEQ ID NO:2) 
were identified by electrospray mass spectrometry, and pooled. 

Step 8 Folding: To the combined fractions containing full-length ligated 

10 peptide SEP-1 (1-166) OP 89 '" 7 ){SEP-1:1(+GP41)+SEP-1:2+SEP-1:3+SEP- 
1:4+GP41} [i.e. 1711(GP41)-1712-1713-1776-GP41] was added solid guanidinium 
chloride, 1M Tris buffer (pH 8.7) and distilled water to make the final solution 
0.1 milligrams per ml ligated peptide SEP-1 (1-166), 6M guanidinium chloride, 
lOOmM Tris, This ligated peptide ('protein') solution was loaded into 15ml dialysis 

15 cassettes and dialyzed overnight at 4 "C against a solution of 100 mM Tris buffer 
(pH 8.5) containing 3 M guanidinium chloride, 5micromolar cysteine and 2 
micromolar cystine. The protein solution was then dialyzed against a solution of 100 
mM Tris buffer (pH 8.5) containing 1 M guanidinium chloride for 8 hours at 4 °C 
and Finally dialyzed against 10 mM Tris buffer (pH 7.0) for 14 hours at 4°C to yield 

20 the final folded product. The folded protein-containing solutions from the dialysis 
cassettes were combined. Folding was verified by ES-MS, analytical RP-HPLC, and 
by CD spectrometry. 

Step 9 Purification: The folded protein-containing solution was loaded on to 
Q-Sepharaose ion exchange column equilibrated at 10 mM Tris, pH 7.0. The folded 

25 protein was eluted using a linear salt gradient to 125 mM NaCl. Fractions containing 
the desired folded product SEP-1-B51 were identified by non-reducing SDS-PAGE, 
and pooled. The combined fractions were concentrated by ultrafiltration to a 
concentration of approximately 2 milligrams per milliliter, then the protein solution 
was loaded onto a 2.6x100cm S-300 gel filtration column. The purified SEP-1-B51 

30 was eluted with lOmM Tris pH7.0, 137 mM sodium chloride. Fractions containing 
high purity SEP-1-B51 were identified by non-reducing SDS-PAGE, and were 
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pooled, frozen and stored at - 80 °C. The final, purified folded protein SEP-1-B51 
was characterized by analytical (C4) reverse-phase HPLC, ES-MS, by non-reducing 
SDS-PAGE, and CD spectrometry. Figure 27 shows a representative Isoelectric 
Focusing Gel (IEF) and non-reducing SDS-PAGE gel showing the relative 
5 molecular weight of the folded, purified SEP1-B51. A molecular weight standard 
run on the same gels are shown for comparison. As shown, the relative molecular 
weight of SEP1-B51 by determined by non-reducing SDS-PAGE is approximately 
73kDa. The relative pi is approximately 5.0. Also shown is a representative RP- 
HPLC chromatogram of the folded, purified SEP-I-B51 product. This illustrates the 
10 purity and increased relative molecular weight of the precision polymer-modified 
proteins of the present invention. 

Example 8 

Synthesis Of Synthetic Eryturopoiesis Stimulating Protein SEP-1-B52 

A. Composition of SEP-1-B52. A seventh synthetic erythropoiesis stimulating 
15 protein (designated SEP-1-B52) was synthesized. The amino acid sequence of the 
full-length SEP-1-B52 is the same as that of SEP-1-B51: 

APPRLICDSR VLERYLLEAK EAEK ox ITTGCA EHCSLNEKIT 

VPDTKVNFYA WKRMEVGQQA VEVWQGLALL SEAVLRGQAL 

LVKSSQPW\]/P LQLHVDKAVS GLRSLTTLLR ALGAQKVJ/AIS 

20 P P D A AK ox A A P L RTITADTFRK LFRVYSNFLR GKLKLYTGEA 

CRTGDR (SEQ ID NO: 2) 

where Y denotes an non-native amino acid residue consisting of a cysteine 
that is carboxymethylated at the sulfhydryl group, and where K ox denotes a non- 
native lysine that is chemically modified at the e-amino group with an oxime linker 
25 group coupled to a designated water-soluble polymer through an oxime bond. 

The SEP-1-B52 protein was derivatized at residues 24 and 126 with a 
branched (Succ-TTD) 12 -Succ-Ala polymer construct similar to SEP-1-B51, but for 
this analog the polymer was attached via an oxime bond between pyruvic acid and 
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aminooxyacetic acid. Moreover, the e-amino group of lysine residues 24 and 126 
were modified to bear an aminooxyacetyl functional group instead of a levulinic acyl 
moiety, and the branched (Succ-TTD)i 2 -Succ-Ala polymer construct was made to 
bear a pedant pyruvic acyl moiety. These changes were designed to improve 
5 synthesis and handling, and to further increase stability of the oxime linkages. 

Figure 25 is a schematic illustrating the chemical synthesis of the branched 
water-soluble polymer GRFNP43 (GP43) that was joined to the SEP-1-B52 through 
oxime-forming ligation as described below. Assembly of the full-length product was 
as described in Example 2. The structure of SEP-1-B52 is shown in Figure 26. 

10 B. Synthesis of (Succ-TTD)i 2 -Succ-AlaOtBu (GRFNP39) for 

coupling to the branching template (GRFNP42). 

(Succ-TTD) 12 -Succ-AlaOtBu (GRFNP39) was synthesized on a 0.5 mmol 
scale. 0.5mmole (-0.5 grams) Sasrin acid labile, carboxylic acid-generating 
polystyrene resin (hydroxyl substitution 1.02 mmole/g;) was swelled in DMF for 15 

15 minutes and then drained. To this hydroxyl-functionalized resin was added 450 mg 
(4.5 mmole) succinic anhydride and 488 mg (4 mmole) 4-(dimcthylamino)pyridine 
dissolved in 8 ml of DMF containing 500 microliter (3.9 mmole) DIEA 
(diisopropylethylamine) and allowed to react for 30 minutes with vortex agitation, 
then drained. The coupling was repeated and excess reactants and soluble 

20 coproducts were removed by a 1 minute vortexing flow wash with DMF (~50ml), 
then drained. The HOOC-CH 2 CH 2 CO-0-resin (0.5 mmole) was activated by 
addition of 8 ml of fresh 1.0 M (8 mmole) CDI solution in DMF and allowed to react 
for 40 minutes, then drained. Excess reactants and soluble coproducts were removed 
by a 1 minute vortexing flow wash with DMF (~50ml), and drained. 4 ml (4 grams, 

25 18.2 mmole) TTD dissolved in 4 ml 0.5M (2 mmole) HOBT solution in DMF was 
added and allowed to react with vortex agitation for 30 minutes and drained. Excess 
reactants and soluble coproducts were removed by a 1 minute vortexing flow wash 
with DMF (~50ml) and drained. Succinic anhydride (450 mg , 4.5 mmole) dissolved 
in 8 ml of 0.5M (4 mmole) HOBT (N-hydroxybenzotriazole) solution containing 

30 500 microliter (3.9 mmole) DIEA was added to the resin and allowed to react with 
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vortex agitation for 15 minutes, then drained. The three steps (CDI activation; TTD 
coupling; succinic anhydride reaction) were repeated eleven times [i.e. a total of 
twelve times]. Excess reactants and soluble coproducts were removed by a 1 minute 
vortexing flow wash with DMF (~50ml), and drained. The HOOC- 
5 CH2CH2CO(TTD-succinyl) 12 -0-resin (0.5 mmole) was activated with 8ml of fresh 
1 .0 M (8 mmole) CDI solution in DMF, allowed to react for 40 minute, and drained. 
Excess reactants and soluble coproducts were removed by a 1 -minute vortexing flow 
wash with DMF (~50ml), and drained. 2.5 mmole H-AlaOtBu.HCl was dissolved in 
4.75 ml 0.5 M (2.375 mmole) HOBT in DMF containing 150 microliter (111 mg, 

10 0.825 mmole) DIE A, and allowed to react with the CDI-activated HOOC- 
CH 2 CH 2 CO(TTD-succinyl) 12 -0-resin (0.5 mmole) for 1 hour with vortex agitation, 
then drained. Excess reactants and soluble coproducts were removed by a 1 -minute 
vortexing flow wash with DMF (~50ml), and then drained. The product tertBuOOC- 
CH(CH3)-NH-OC-CH 2 CH 2 CO(TTD-succinyl)i 2 -0-resin was washed extensively 

15 with DCM, drained and then the resin was dried under vacuum to constant weight. 
Typical weight of product-resin was around 2 grams. 

The linear GRFNP39 was cleaved from the resin support according to 
standard Fmoc-chemistry procedures using 4% TFA in DCM. The precipitated 
crude product was dissolved in 50% aqueous acetonitrile containing 0.1% TFA and 

20 lyophilized. The lyophilized polymer was dissolved in a small amount of 50% 
aqueous acetonitrile containing 0.1% TFA and diluted to reduce the concentration of 
organic below 1%. The crude product was loaded onto a C4 preparative reverse- 
phase HPLC column equilibrated at T = 40°C at 3% B. Salts were eluted 
isocratically and the desired template was purified with a linear gradient of 20-35% 

25 Buffer B (acetonitrile containing 0.1% TFA) versus 0.1% aqueous TFA over 60 
minutes. Fractions containing the desired (Succ-TTD)i 2 -Succ-AlaOtBu material 
(GRFNP39) were identified by ES-MS, frozen and lyophilized. 
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C. Synthesis of a template carrying multiple amine groups for 
branching and a pendant pyruvic acyl moiety for (later) 
attachment to the protein (GRFNP42) 

The template was synthesized manually on a Boc-Leu-OCH2-Pam-resin on a 
5 0.4 mmol scale. A one-minute flow-washing step with DMF was used between 
every coupling, deprotection and activation step. The Boc group was removed by 
treatment with neat (i.e. 100%) TFA. After DMF washing, 2 mmol Fmoc-(Rink- 
linker)-OH was coupled to the resin after activation with 1.8 mmol HBTU in 3.8 ml 
DMF containing 1 ml DIEA. After removal of the Fmoc protecting group (2x3 

10 minutes 20% piperidine in DMF), 2 mmol Fmoc-Lys(MTT)-OH [MTT = 4- 
methyltrityl] was coupled to the resin using NHS-ester activation with 2 mmol DIC 
and 2 mmol NHS in DMF. After removal of the Fmoc protecting group (2 x 1 
minute 0.5% DBU in DMF), 4 mmol succinic anhydride dissolved in 8ml of 0.5M 
HOBT solution containing 2.2 mmol DIEA was coupled to the resin for 10 minutes. 

15 After this step, the resin-bound carboxyl group was activated with 8 ml of fresh 0.5 
M CDI solution in DMF. 4 ml TTD was added in 4 ml 0.5M HOBT solution and 
coupled for 30 minutes. 

2 mmol Fmoc-Lys(Fmoc)-OH was coupled to the resin using NHS-ester 
activation with 2 mmol DIC and 2 mmol NHS in DMF. After an Fmoc removal step 

20 (2x1 minute 0.5% DBU in DMF), 4 mmol Boc-Lys(Boc)-NHS is coupled in 3 ml 
DMF. 

The MTT protecting group was removed by multiple washes with 2% TFA in 
DCM. Deprotection was complete when the supernatant lost its yellow color. The 
resin was neutralized with 10% DIEA in DMF for one minute. 2 mmol pyruvic acid 

25 was coupled to the resin using NHS-ester activation with 2 mmol DIC and 2 mmol 
NHS in DMF for 45 minutes. The template was deprotected and cleaved from the 
resin support using neat TFA containing 5% water. The cleavage solution was 
evaporated to dryness in a rotator evaporator. The residue was dissolved in 50% 
aqueous acetonitrile containing 0.1% TFA and lyophilized. The lyophilized 

30 template was dissolved in a small amount of 50% aqueous acetonitrile containing 
0.1% TFA and diluted to reduce the concentration of organic below 1%. The 
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pyruvate-containing template was loaded onto a C4 Prep column equilibrated at T = 
40°C at 0 % Buffer B [i.e. 100% Buffer A = 0.1%TFA in water]. Salts were eluted 
isocratically and the desired template was purified with a linear gradient of 5-12 % 
Buffer B versus 0.1% aqueous TFA in 60 minutes. Fractions containing the desired 
5 material (GRFNP42) were identified by ESI-MS, frozen and lyophilized. 

D. Assembly and deprotection of the amide-coupled branched 
polymer (GRFNP43) 

GRFNP43, a branched (TTD-Succ) 49 - polymer of 16kDa molecular weight 
was synthesized by coupling GRFNP39 to the purified template GRFNP42. Purified 

10 (Succ-TTD) 12 -Succ-AlaOtBu (GRFNP39) (l.Ommole) dissolved in DMSO at 60°C 
to a concentration of 20 mg/ml was activated with 0.95mole of HATU in DMSO at 
a concentation of 10 mg/ml in the presence of a twenty-fold (molar) excess of DIEA. 
Purified template (0.24mole)GRFNP42 dissolved in DMSO at a concentration of 3.9 
mg/ml was added immediately. Progress of the reaction was monitored by analytical 

15 C4 reversed-phase HPLC and ES-MS. Typically, the coupling was complete within 
minutes. For work-up, 4 volumes (relative to ligation mix) 0.1M acetate / 6 M 
guanidinium chloride, pH 4 was added, and the solution was loaded onto a 
preparative (C4) reverse-phase HPLC column. Salts and other non-amide containing 
material were eluted isocratically and the desired product branched polymer was 

20 purified with a linear gradient of 20-35 % Buffer B (acetonitrile containing 0.1% 
TFA) versus 0.1% aqueous TFA over 80 minutes. Fractions containing the desired 
material were identified by ES-MS, frozen and lyophilized. 

The resulting purified branched polymer construct GRFNP43 was dissolved 
in neat TFA at a concentration of 1 mg/ml for 1 hour to remove the Ala-OtBu 

25 tertButyl ester protection. The solution was evaporated to dryness in a rotary 
evaporator and the dried polymer was dissolved in 50% Buffer B (acetonitrile 
containing 0.1% TFA). The polymer was desalted on a preparative reverse phase 
HPLC with a step gradient from 15% to 45% Buffer B versus 0.1% aqueous TFA in 
80 minutes. Pooled fractions containing the desired material (GRFNP43) were 
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frozen and Iyophilized, and the dried powder used for the oximation-forming 
ligation step. 

E. Oxime-forming ligation (oximation) of GRFN1776 and 
GRFN1711 with the branched polymer GRFNP43 

5 Segments SEP- 1:4, and Segment SEP-1:1 were synthesized as described 

above in Examples 2, 3, 4, and 7, only that instead of levulinic acid an 
aminooxyacetyl (AoA) moiety was appended to Lys 24 and Lys 126 in the respective 
peptide segments following standard coupling protocols. Thus, after assembly of the 
peptide-resins, the side chain Fmoc group was removed from each peptide-resin and 

10 2 mmol Boc-aminooxyacetic acid was added to the resin using NHS-ester activation 
with 2 mmol DIC and 2 mmol NHS in DMF. The two peptides were deprotected 
and cleaved from the resin with HF and purified by C4 reverse-phase HPLC. 
Suitable precautions were taken to avoid exposure to carbonyl compounds. Segment 
SEP-1:4 and GRFNP43 were jointly dissolved at an equimolar ratio in 50% aqueous 

15 acetonitrile containing 0.1% TFA (trifluoroacetic acid). The solution was 
Iyophilized. The dried powder was re-dissolved in 50% aqueous acetonitrile 
containing 0.1% TFA and loaded onto a preparative C4 reverse-phase HPLC 
column. The polymer-modified peptide was separated from unmodified peptide and 
unreacted polymer by preparative gradient reverse-phase HPLC. Fractions 

20 containing the desired oximated product SEP-1:4+GP43 were identified by ES-MS 
and pooled. 

Segment SEP-1:1 and GRFNP43 were jointly dissolved at an equimolar ratio 
in 50% aqueous acetonitrile containing 0.1% TFA. The solution was frozen and 
Iyophilized. The dried powder was dissolved in 50% aqueous acetonitrile containing 
25 0.1% TFA and loaded onto a preparative gradient C4 reverse-phase HPLC column. 
The polymer-modified peptide was separated from unmodified peptide and 
unreacted polymer by preparative reverse-phase gradient elution. Fractions 
containing the desired oximated product SEP-1:1+GP43 were identified by ES-MS 
and pooled. 
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F. Synthesis Of Synthetic Erythropoiesis Stimulating Protein SEP- 
1-B52 

SEP-1-B52 (SEQ ID NO:2) was synthesized in solution from four 
polypeptide segments: 

5 Segment SEP:1:1+GP43 (GRFN 1711+GRFNP43; corresponding to residues 1-32 
ofSEQIDNO:2): 

APPRLICDSR VLERYLLEAK EAEK° X ITTGCA EH-thioester (where 
Lys 24 has an AoA pendant moiety oxime-Iinked to the branched polymer 
GRFNP43, as denoted by K ox ; and where His 32 is Dnp protected) 

10 Segment SEP-1 :2 (GRFN 1712; corresponding to residues 33-88 of SEQ ID NO:2): 
CSLNEKIT VPDTKVN FYA WKRMEVGQQA VEVWQGLALL 
SEAVLRGQAL LVKSSQPW-thioester (where Cys 33 is Acm protected; and 
where the three Trp residues are formyl protected) 

Segment SEP-1 :3 (GRFN 1713, corresponding to residues 89-116 of SEQ ID 
15 NO:2): 

CP LQLHVDKAVS GLRSLTTLLR ALGAQK-thioester (where Cys 89 is 
Acm protected; and where His 94 is Dnp protected) 

Segment SEP-1 :4+GP43 (GRFN 1776+GRFNP43, corresponding to residues 117- 
166 ofSEQIDNO:2): 
20 CAIS PPDAAK ox AAPL RTITADTFRK LFRVYSNFLR 

GKLKLYTGEA CRTGDR-carboxylate (where the C-terminal cysteine (i.e. 
Cys 161 ) carries a picolyl (pico) protecting group, and where Lys q126 has an 
AoA pendant moiety oxime-linked to the branched polymer GRFNP43, as 
denoted by K ox ). 

25 Synthesis of additional peptides, ligation reactions, carboxymethylation, 

protecting group removal reactions, folding and purification are performed as 
described above in Examples 1, 2, 3, 4, and 7, to yield full-length, folded SEP-1-B52 
(SEQ ID NO:2), which was characterized by analytical (C4) reverse-phase HPLC, 
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ES-MS, and non-reducing SDS-PAGE. Bioassays are performed as described for 
the other SEP constructs. 

Example 9 
Efficacy studies for SEP-3-L42 

5 SEP-3-L42 was reformulated in Citrate Buffer (20mM sodium citrate + 

lOOmM sodium chloride) plus 0.25% rat serum albumin (RSA) and was 
administered intravenously to normal male rats (5 rats per group) at doses of 0, 1, 5, 
or 10 p.g/kg, tiw, on Days 1, 3, and 6. Blood samples were collected 4 days after the 
last injection (Day 9) and analyzed for hematologic parameters. There were no 
10 statistically significant differences in red blood cell (RBC), hemoglobin (HGB), 
hematocrit (HCT), and reticulocyte count (RET) values at 4 days after the last 
injection with SEP-3-L42 at these doses when compared with those of the control 
group. 

Example 10 

15 Pharmacokinetic studies for SEP-3-L42 

SEP-3-L42 was reformulated in Citrate Buffer (20mM sodium citrate + 
100mM sodium chloride) plus 0.25% RSA, pH6.9, and was administered 
intravenously as a single dose to normal male rats at a dose level of 5 ug/kg. Blood 
samples were collected at 0, 1, 2, 4, 6, 12, 24, 48, 72, 96, 120, 144, 168 hours after 
20 the dosing. The plasma SEP-3-L42 concentration was determined by anti-EPO 
ELISA kit (R&D Systems, Human Erythropoietin Quantikine IVD Immunoassay 
Kit #DEP00) according to the manufacturers instructions. The results are shown in 
Figure 29. 

As shown in Figure 29, SEP-1-B50 exhibited slightly slower clearance 
25 compared to SEP3-L42. Despite its circulating half-life, SEP3-L42 failed to exhibit 
statistically significant differences in promoting red blood cell production at the 
doses tested. In contrast, the SEP-1-B50 stimulated red blood cell production at the 
same doses. This illustrates that polymer structure can be exploited for fine-tuning 
in vivo biological properties, including pharmacokinetic behavior and potency. 
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Example 11 
Efficacy studies for SEP-1-L30 

SEP-1-L30 was reformulated in Citrate Buffer (20mM sodium citrate + 
1 OOmM sodium chloride) plus 0.25% RSA and was administered intravenously to 
5 normal male rats (5 rats per group) at doses of 0, 1, 5, 25, or 50 ug/kg, tiw, on Days 
1, 3, and 5. Blood samples were collected 4 and 8 days after the last injection (Days 
9 and 13) and analyzed for hematologic parameters. There were no statistically 
significant differences in RBC, HGB, and HCT values at any interval after treatment 
with SEP-1-L30, when compared with those of the control group. Reticulocyte 
10 counts were higher at 4 days after the last injection (Day 9) for rats treated with 25 
and 50 (.ig/kg (statistically significant difference when compared with those of the 
control group). No other differences in hematologic parameters were observed at 
either Day 9 or Day 13 for any of the animals treated with SEP-1-L30. [Data not 
shown] 

15 Example 12 

Pharmacokinetic studies for SEP-1-L30 

SEP-1-L30 was reformulated in Citrate Buffer (20mM sodium citrate + 
lOOmM sodium chloride) plus 0.25% RSA, pH6.9, and was administered 
intravenously to normal male rats as a single dose at a dose level of 5 or 25 ug/kg. 
20 Blood samples were collected at 0, 1, 2, 4, 6, 12, 24, 48, 72, 96, 120, 144, and 168 
hours after the dosing. Plasma concentration of SEP-1-L30 was determined by 
ELISA kit (R&D Systems, Human Erythropoietin Quantikine P/D Immunoassay 
Kit #DEP00) according to manufacturers instructions. No SEP-1-L30 was 
detectable at any of the time points at the doses given. 

25 Example 13 

Efficacy studies for SEP-1-B50 

SEP-1-B50 was reformulated in sterile PBS (phosphate buffered saline) plus 
0.25% RSA and was administered intravenously to normal male rats (5 rats per 
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group) at doses of 0, 1, 5, 25, or 125 ug/kg, tiw, on Days 1, 3, and 6. Blood samples 
were collected 4, 9, and 14 days after the last injection (Days 10, 15, and 20) and 
analyzed for hematologic parameters. The data is shown in Table VIII below. A 
dose-related response in increased RBC, HGB, HCT, and RET was observed at 4 
5 days after the last injection (Day 10) for animals receiving 5, 25, and 125 ug/kg 
SEP-1-B50 (statistically significant difference when compared with those of the 
control group). RBC, HGB, and HCT for these animals remained higher than 
control values on Day 15 and continued to be significantly higher than control values 
through Day 20 (14 days after the last injection) for animals treated at 125 ug/kg. 
10 RET production was reduced after Day 10, with significantly lower counts on Days 
15 and 20 for the 25 and 125 ug/kg animals. 

Table VIII 



Hematology Findings Following Multiple Doses of SEP-1-B50 



RBC 


Dav 10 


Day 15 


Day 20 


Control 
1 ug/kg 
5 ug/kg 
25 p.g/kg 
125 ug/kg 


7.01 +0.301 
7.07 ±0.482 
7.16 ±0.402 
7.98 ±0.484*'* ;, 
8.71 ± 0.512** 


6.86 ±0.391 
6.85 ±0.446 
7.27 ±0.229 
7!73 ± 0.448* ■:. 
8.89 + 0.598** 


6.96 ±0.178 
7.00 + 0.315 
6.92 ±0.238 
7.20 ±0.331 
8.25 ±0.641** 






HGB 


Day 10 


Day 15 


Day 20 


Control 
1 ^g/kg 
5 ug/kg 
25 ug/kg 
125 ug/kg 


14.2 ±0.49 
14.4 ±1.18 
15.6 ±0.43* 
16,1 + 0:49** 
16.9 ±0.82** 


14.0 + 0.61 

14.2 ±0.44 

15.3 ±0.33* 

15.1 + 0.57 

16.4 + 0.32** 


13.9 ±0.36 
14.1 ±0.30 
14.4 ±0.30 
13.8 ±0.22 
15.0 + 1.07* 




HCT 


Day 10 


Day 15 


Day 20 


Control 
1 ug/kg 
5 ug/kg 
25 ug/kg 
125 ug/kg 


41.1 ±1.39 

42.6 ±1.82 

44.7 + 1.6.9* 
46:6±.1.50** 

49.8 + 2.13** ■ 


40.5 + 0.93 

40.6 ±1.83 
44.1 ±1.27 
43.6 ±1.61 
48.4 - 3.95** 


40.9 ±1.33 

41.2 + 1.13 
41.5 ±0.90 

40.3 ± 1.05 

44.3 + 3.39* j 
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RET 


Day 10 


Day 15 


Day 20 


Control 

1 ug/kg 
5 ug/kg 
25 ug/kg 
125 ug/kg 


2.4+1.14 
3.8+1.14 
6.6+3.6.0* 
6.2+1,12* 
8.4+1.60** 


2.5 + 1.10 
2.3 + 0.46 
1.8 ±0.48 
111 ±0.24* 
0.9 ±0.50** 


0.9 + 0.13 
1.3 ±0.41 
0.8 ±0.26 
0.2 ±0.21** 
0.3 + 0.36** 



*Significant difference from control, p<0.05 
** Significant difference from control, p<0.01 



Example 14 
Pharmacokinetic studies for SEP-1-B50 

SEP-1-B50 was reformulated in Citrate Buffer (20mM sodium citrate + 
lOOmM sodium chloride) plus 0.25% RSA, pH6.9, and was administered 
5 intravenously to normal male rats as a single dose at a dose level of 5 or 25 ug/kg. 
Blood samples were collected at 0, 1, 2, 4, 6, 12, 24, 48, 72, 96, 120, 144, 168 hours 
after dosing. Plasma concentration of SEP-1-B50 was determined by ELISA kit (R 
& D Systems, Human Erythropoietin Quantikine IVD Immunoassay Kit #DEP00) 
according to manufacturers instructions. The elimination half-lives were determined 
10 to be 9.7 and 9.9 hours for 5 and 25 ug/kg dose, respectively. The observed MRT 
(mean residence time) was 13.9 and 14.4 hours for the 5 and 25 ug/kg dose, 
respectively. A representative pharmacokinetic profile for SEP-1-B50 is shown in 
Figure 29. 

Example 15 

1 5 Efficacy studies for SEP-1-B51 

SEP-1-B51 was administered intravenously to normal male rats in two 
experiments. 

Experiment 1: SEP-1-B51 was reformulated in Citrate Buffer (20mM 
sodium citrate + lOOmM sodium chloride) plus 0.25% RSA and was administered 
20 intravenously to male rats (5 rats per group) at doses of 0, 1, 5, or 25 Jiig/kg, tiw, on 
Days 1, 3, and 6, and blood samples collected at 4, 9, and 14 days after the last 
injection (Days 10, 15, and 20) for analysis of hematologic parameters. The data is 
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shown in Table IX below. Four days after the third and last intravenous injection 
with SEP-1-B51 (Day 10), a statistically significant increase in RBC, HGB, HCT, 
and absolute reticulocyte counts (ART) was observed for animals receiving 25 ug/kg 
SEP-1-B51, when compared with the values for the control group. The RBC value 
5 for these animals remained higher than the control value on Day 15 and was 
comparable to the control value on Day 20. Reticulocyte production was reduced 
after Day 10, with significantly lower counts observed for the 5 and 25 ug/kg 
animals on Day 15. 

Table IX 

10 Hematology Findings Following Multiple Doses of SEP-1-BS1 



RBC 


i Day 10 


Day 15 


Day 20 


Control 
1 ug/kg 
5 ug/kg 
25 ug/kg 


5.66 ±0.497 
5.32 + 0.275 
6.22 + 0.377 
6.70 + 0.257** 


5.62 ±0.385 
5.23 + 0.470 
6.18 + 0.298 
6.25 + 0.348* 


5.90 ±0.286 
5.77 + 0.200 
6.39 ±0.369 
5.94 + 0.294 




HGB 


Day 10 


Day 15 


Day 20 


Control 
1 ug/kg 
5 ug/kg 
25 ug/kg 


14.3 + 1.10 
13.3 + 0.41 
15.1 ±0.53 
16.5 ±0.52** 


14.2 I 0.88 
13.2 + 0.93 
14.8 ±0.30 
15.0± 0.91 


14.8 ±0.72 

14.3 ±0.37 

15.4 ±0.25 
14.3 ±0.64 




HCT 


Day 10 


Day 15 


Day 20 


Control 
1 ug/kg 
5 ug/kg 
25 ug/kg 


36.8 ±3.08 
34.5 + 1.21 

39.9 ±1.51 
43.8 + 1.83** 


36.2 ±2.59 
33.4 + 2.56 
38.6 ±0.91 

39.3 + 2.77 


37.9 ±1.93 

37.1 ±0.82 

40.2 ±1.36 
37.2 + 1.60 




ART 


Day 10 


Day 15 


Day 20 


Control 
1 ug/kg 
5 ug/kg 
25 ug/kg 


0.29 ± 0.089 
0.34 ±0.50 
0.28 ±0.059 
0.48 + 0.121* 


0.28 ±0.086 
0.23 ±0.055 

! b!i±o;02i**' 

0.06 + 0.054** 


0.21 ± 0.034 
0.19 ±0.075 
0.20 + 0.081 
1.45 + 3.100 



** significant difference from control p<0.01 
* significant difference from control p<0.05 



Experiment 2 . SEP-1-B51 was reformulated in Citrate Buffer (20mM 
sodium citrate + lOOmM sodium chloride) plus 0.25% human serum albumin and 



WO 02/20033 



PCT7US01/21930 



- 155- 

was administered intravenously to male rats (5 rats per group) at doses of 0, 1, 5, or 
25 jig/kg, tiw, on Days 1, 3, and 5, and blood samples collected at 2, 4, and 6 days 
after the last injection (Days 7, 9, and 11) for analysis of hematologic parameters. In 
addition, blood samples for measurement of hematocrit only [as packed cell volume 
5 (PCV)] were collected daily on the remaining study days (Days 2, 3, 4, 5, 6, 8, 10, 
12, and 13). The data is shown in Table X below. HCT (PCV or calculated) values 
were significantly increased for animals receiving 5 and 25 ug/kg beginning 2 days 
after the first injection (Day 3) and continuing through 6 days after the third and last 
injection (Day 11). RBC and HGB were also significantly increased on the days 
10 that they were measured (2, 4, and 6 days after the last dose; Days 7, 9, and 11) for 
animals receiving 5 and 25 ug/kg. ART was significantly increased only for the 25 
ug/kg group at 2 and 4 days after the last dose (Days 7 and 9). 

Table X 



Hematology Findings Following Multiple Doses v of SEP-1-B51 



RBC 


Day 7 


Day 9 


Day 11 


Control 
1 ug/kg 
5 ug/kg 
25 ug/kg 


5.61 +0.679 
6.08 + 0.141 
6.29 + 0.459 
6.61 +0.295 ** 


5.23 + 0.467 
5.70 ±0.300 

6.07 + 0.308**' V; 

6.24 H 0.268** 


5.34 + 0.405 
5.67 ±0.547 
6.11 + 0.248* 
6.38 + 0.173** 




HGB 


Day 7 


Day 9 


Day 11 


Control 
1 ug/kg 
5 ug/kg 
25 ug/kg 


13.7+1.07 

14.6 ±0.27 
15.8_l0.50** 

16.7 ±0.86** '; 


12.9 + 0.79 
13.9±0.19 
14.9 _ 0.53** 
15.1 ±0:66** -j 


13.3 ±0.93 
13.9 ±0.90 
14.8 + 0.18* ' 
14.6 + 0.69* : 




HCT 


Day 7 


Day 9 


Day 11 


Control 

1 ug/kg 
5 ug/kg 
25 ug/kg 


35.6 + 3.61 
38.9 + 0.82 

41.3 + 1.92** 

44.4 +2.08** 


33.2 ±2.39 
35.9+ 0.94 

39.1+1.0:1** 

39.7 - 2.23** ;. 


34.5 + 3.19 
35.9 + 2.43 
39.2 + 0.78* 
39.0 + 2.32**7/ 
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ART 


Day 7 


Day 9 


Day 11 


Control 
1 ug/kg 
5 ug/kg 
25 ug/kg 


0.50± 0.158 
0.30 + 0.044* 
0.45 + 0.125 
0.78 + 0.117** 


0.36 ±0.093 
0.30 + 0.026 
0.36 ±0.056 
0.71 ±0.152** 


0.34 ±0.079 
0.25 ± 0.054 
0.25 ±0.045 
0.32 ±0.099 



*Significant difference from control, p<0.05 
**Significant difference from control, p<0.01 
v Note: Day 7 (2 days post 3rd dose); Day 9 (4 days post 3rd 
dose); Day 1 1 (6 days post 3rd dose) 



Additional groups of rats were treated with a single intravenous dose of SEP- 
1-B51 at 25 pg/kg, and blood samples were collected at 48 hr after injection for 
analysis of hematologic parameters. Blood samples for measurement of hematocrit 
only (as packed cell volume) were collected at 8, 24, and 72 hr and at 7 days after 
5 injection (Days 1, 2, 4, and 8). HCT, HGB, and ART values for the rats treated 
intravenously with SEP-1-B51 were higher than those of the control animals 
(statistically significantly) on Day 3 (2 days after injection), and HCT was also 
significantly increased in these animals on Day 4. 

Example 16 

10 Efficacy Studies In Polycythemic and Hypoxic Model for SEP-1-B51 

Groups of 10 normal mice each (vehicle control, recombinant glycosylated 
human erythropoietin produced in CHO-cells ("rhEPO") at 4 dose levels, and SEP- 
1-B51 at 4 dose levels: 0.32, 1, 5, 25 ug/kg/dose assuming 100 mU/ng) were 
exposed 18 hr per day to atmospheric air maintained at 506.5 mb in a simulated high 

15 altitude chamber for 20 days. The rhEPO or SEP-1-B51, formulated in Citrate 
Buffer plus 0.25% human serum albumin, was injected intravenously (IV) in a 200 
|.iL-volume on the 4 th post-hypoxic day. Two days later, each mouse was injected 
intraperitoneally (i.p.) with 0.2 uCi of 59 Fe. RBC-radioiron uptake was estimated 3 
days later by measuring the amount of radioactivity present in 500 pL of blood taken 

20 by cardiac puncture. 

The 72-hour RBC- 59 Fe uptake (% of dose) for SEP-1-B51 and rhEPO 
showed a clear dose related response in increased 59 Fe uptake with increasing dose 
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up to 5 ug/kg, above which the response plateaued (25 ug/kg). The three lowest 
dose levels were therefore used to calculate linear regression. The linear regression 
analyses of SEP-1-B51 and rhEPO are presented in Figure 30. The responses of 
SEP-1-B51 and rhEPO were essentially the same. The "potency ratio" of SEP-1- 
5 B51 to rhEPO was 1.0035 (95% confidence interval of 0.6947 to 1.4498) and the 
potency of SEP-1-B51 determined in this assay was 100 mU/ng (95% confidence 
interval of 69-145). 

SEP-1-B51 exhibits in vivo activity similar to that of rhEPO and acts in 
dose-dependent manner, including a response plateau at higher doses due to 
10 induction of a negative feedback mechanism typical for rhEPO in this model. This 
illustrates that the polymer structures attached at precise user-defined glycosylation 
sites in the SEP-1-B51 molecule mimic the in vivo activity contributed by the sugar 
chains of rhEPO. 

Example 17 

1 5 Pharmacokinetic Studies for SEP-1-B51 

Groups of normal male rats were dosed intravenously with a singe dose of 5 
ug/kg SEP-1-B51, formulated in Citrate Buffer plus 0.25% human serum albumin, 
or rhEPO (equivalent to 500 U/kg) and bled at 5, 30, 60 minutes, 2, 4, 8, and 24 
hours, and 2, 3, 4, 5, 6, and 7 days following the dose. The SEP-1-B51 or rhEPO 

20 concentrations in the plasma samples were determined by ELISA assays using a 
R&D Systems anti-EPO ELISA kit (R&D Systems, Human Erythropoietin 
Quantikine IVD Immunoassay Kit #DEP00) according to manufacturers instructions. 

A least squares analysis of the logarithms of the concentrations was 
performed, yielding the pharmacokinetic parameters listed in Table XI below. The 

25 change in plasma concentration of SEP-1-B51 with time in male rats receiving a 
single intravenous dose of 5 ug/kg could be described by a mono-exponential 
pharmacokinetic disposition function with half-life of 10.5 ± 0.5 hours. The volume 
of distribution of the central compartment (Vc) for SEP-1-B51 was 32.5 + 2.0 
mL/kg. The clearance (CL) was 2.15 mL/hr/kg, with mean residence time (MRT) of 
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15.1 ± 0.7 hours. By comparison, the change in plasma concentration of rhEPO in 
male rats receiving an intravenous dose of 5 jig/kg was best described using a bi- 
exponential pharmacokinetic disposition function, with a half-life of 1.24 ± 0.22 
hours and (3 half-life of 5.51 ± 0.40 hours. The Vc for rhEPO was 57.0 ±3.2 mL/kg. 
5 The CL was 16.0 + 0.5 mL/hr/kg, about 8 fold greater than that of SEP-1-B51. The 
mean residence time (MRT) of rhEPO was 5.73 ±0.17 hours, about 3 fold shorter 
than that of SEP-1-B51. 

Table XI 



Pharmacokinetic Parameters for SEP-1-B51 Compared to rhEPO 



Curve 


AUC 
(ng/mL-hr) 


Cmax 
(ng/mL) 


Vc 
(mL/kg) 


CL 
(mL/hr/kg) 


SEP-1-B51 


2326 ±121 


154 ±9 


32.5 ±2.0 


2.15 + 0.11 


rhEPO 


312 + 9 


87.9 ±5.0 


57.0 ±3.2 


16.0 + 0.5 




Curve 


MRT 
(Hours) 


Tl/2a 
(Hours) 


T1/2P 
(Hours) 


SEP-1-B51 


15.1+0.7 


10.5 + 0.5 


N/A 


rhEPO 


5.73 ±0.17 


1.24 ±0.22 


5.51 ±0.40 



10 The graphic presentation of SEP-1-B51 and rhEPO plasma clearance is in 

Figure 31. These data illustrate a significant increase in circulating half-life for 
SEP-1-B51 over the glycosylated recombinant human EPO, and that this increase is 
due to the polymer structures attached at precise user-defined sites in the molecule. 

Example 18 

1 5 Synthesis Of Polymer-Modified Synthetic Rantes Proteins 

In order to further illustrate the polymer modified proteins of the present 
invention, two polymer-modified derivatives of RANTES 2-68 (RANTES G1755- 
01; Figure 32 and RANTES G1755; Figure 33) and two polymer-modified 
derivatives of RANTES 4-66 (RANTES G1805; Figure 34 and RANTES G1806; 
20 Figure 35) were prepared. The structures of these polymer-modified proteins are 
shown in Figures 32 - 35. 
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Preparation of RANTES Analog G1755-01 

The RANTES analog, G1755-01, depicted in Figure 32 was synthesized as 
follows. 

1. Preparation of AOP-[RANTES(2-33)]-thioester 

5 aminooxypetane-glyoxalyl-PYSSDTTPCC FAYIARPLPR 

AHIKEYFYTS GK-thioester 

The N-terminal peptide segment AOP-[RANTES(2-33)]- a thioester (thioester 
also depicted as -C(O)SR) containing an N-terminal aminooxypentane-glyoxalyl 
oxime moiety (AOP) was synthesized by assembly of the RANTES(2-33) sequence 

10 on a thioester-generating resin (Hackeng, et al., PNAS (1999) 96: 10068-10073), and 
then modified at the N-tenninus by direct coupling of an aminooxypentane-glyoxalyl 
oxime moiety [i.e. CH 3 (CH 2 ) 4 -0-N=CHCOOH] to the resin following standard 
protocols (Wilken et al., Chemistry & Biology (1999) 6(1):43-51). The peptide- 
resin was cleaved with hydrogen fluoride and purified by reverse-phase HPLC to 

15 yield the a-carboxy thioester peptide AOP-RANTES(2-33)-thioester. Observed 
mass = 4179.6 Da; Calculated mass = 4178.6 Da (average isotope composition). 

2. Preparation of [RANTES(34-68)]-Lys69(GP6)-Leu70 
CSNPAVVFVT RKNRQVCANP EKKWVREYIN SLEMSK ( GP6 ) L 

where GP6 is bound to the N s of Lys 69: 

o o 

20 a. Solid Phase Peptide and On-{Peptide-Resin} Water-soluble 

Polymer Synthesis 

The C-terminal peptide segment [RANTES(34-68)]-Lys69(Fmoc)-Leu70 
was synthesized by conventional SPPS on a Boc-Leu-OCH2-Pam-resin using the in 
situ neutralization/HBTU activation protocols for Boc chemistry SPPS as described 
25 in Schnolzer, et al, Int. J. Pep. Protein Res. (1992) 40:180-193. 
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After completion of chain assembly, the Fmoc group on the epsilon nitrogen 
of Lys69 was removed from the protected peptide-resin with 20% piperidine in DMF 
treatment. Succinic anhydride was then coupled in an HOBT solution in DMF 
containing DIEA to the epsilon nitrogen on Lys69. After washing, the free carboxyl 
5 group of the resin-bound succinic acid was activated by addition of CDI in DMF. 
After another wash cycle, 50% TTD in 0.5M HOBT in DMF was added. After 
washing, the resin-bound polymer was ready for the next cycle of succinic anhydride 
coupling/CDI activation/TTD coupling. After 6 cycles, succinic anhydride was 
coupled to the last TTD amine in HOBT/DMF plus DIEA solution. 
10 The peptide/linear water-soluble polymer resin was cleaved from the resin 

with hydrogen fluoride and the product purified by reverse-phase preparative HPLC 
to yield [RANTES(34-68)]-Lys69(GP6)-Leu70. Observed mass = 6253 Da; 
Calculated mass = 6252 Da (average isotope composition). 



AHIKEYFYTS GKCSNPAVVF VTRKNRQVCA NPEKKWVREY 
INSLEMSK(GP6)L (SEQIDNO.:4) 

The N-terminal and C-terminal peptide segments described above were 
joined together by native chemical ligation as described Dawson et al, Science 

20 266:776-779 (1994). The full-length polypeptide product in reduced form was 
purified using reverse-phase HPLC with a linear gradient of acetonitrile versus water 
containing 0.1% trifluoroacetic acid. Observed mass = 10,060 Da. 

The full-length polypeptide with GP6 attached was folded with concomitant 
formation of 2 disulfide bonds in aqueous buffer containing a cysteine-cystine redox 

25 couple, and purified by reverse-phase HPLC following standard protocols (Wilken et 
al., Chemistry & Biology (1999) 6(1):43-51). The folded product was homogeneous 
on HPLC. Observed mass = 10,056 Da; Calculated mass = 10,058 Da (average 
isotope composition). 



3. 



Preparation of G1755-01 

aminooxypentane-glyoxalyl-PYSSDTTPCC 



15 



FAYIARPLPR 
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Example 19 
Preparation of RANTES Analog G1755 

The G1755 compound depicted in Figure 33 was synthesized as follows. 

1. Preparation of n-nonanoyl-RANTES(2-33) 

5 n-nonanoyl-PYSSDTTPCC FAYIARPLPR AHIKEYFYTS GK- 

thioester 

n-nonanoyl-RANTES(2-33) was synthesized on a thioester-producing resin 
as described above in Example 18, and then modified at the N-terminus by direct 
coupling of n-nonanoic acid to the resin. The peptide resin was cleaved with 
10 hydrogen fluoride and purified by reverse-phase HPLC to yield n-nonanoyl- 
RANTES(2-33). Observed mass = 4177.0 Da; Calculated mass = 4177.6 Da 
(average isotope composition). 

2. Preparation of RANTES(34-68)-Lys69(GP6)-Leu70 
CSNPAVVFVT RKNRQVCANP EKKWVREYIN SLEMSK { GP6 ) L 

15 a. Solid Phase Peptide and On-{Peptide~Resin} Water-soluble 

Polymer Synthesis 

RANTES(34-68)-Lys69(Fmoc)-Leu70 was synthesized by conventional solid 
phase peptide synthesis on a Boc-Leu-OCH2-Pam-resin using the in situ 
neutralization/HBTU activation protocols for Boc chemistry solid phase peptide 

20 synthesis as described above in Example 18. 

After completion of chain assembly the Fmoc group on the epsilon nitrogen 
of Lys69 was removed from the protected peptide-resin and the GP6 construct was 
synthesized on the peptide resin as described above in Example 18. The 
peptide/linear water soluble polymer resin was cleaved with hydrogen fluoride and 

25 the product purified by reverse-phase preparative HPLC to yield RANTES(34-68)- 
Lys69(GP6)-Leu70. Observed mass = 6252.9 Da; Calculated mass = 6252.2 Da 
(average isotope composition). 

3. Preparation of G1755 
nonanoyl-PYSSDTTPCC FAYIARPLPR AHIKEYFYTS 
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GKCSNPAVVF VTRKNRQVCA NPEKKWVREY INSLEMSK (GP6) L 
(SEQ IDNO.:5) 

a. Assembly of FuII-Length Polypeptide By Native Chemical 
Ligation 

The N-terminal and C-terminal peptide segments were joined together by 
native chemical ligation as described above in Example 18. The full-length 
polypeptide in reduced form was purified using reverse-phase HPLC with a linear 
gradient of acetonitrile versus water containing 0.1% trifluoroacetic acid. Observed 
mass = 10060.7 Da; Calculated mass = 10058.7 Da (average isotope composition). 

b. Folding and Purification of Polypeptide Modified with Water- 
soluble Polymer 

The full-length polypeptide with GP6 attached was folded with concomitant 
formation of 2 disulfide bonds in aqueous buffer containing a cysteine-cystine redox 
couple, and purified by reverse-phase HPLC as described above in Example 18. 
The folded product was homogeneous on HPLC. Observed mass = 10057.4 Da; 
Calculated mass = 10054.7 Da (average isotope composition). 

Example 20 
Preparation of RANTES Analog G1805 

The G1805 compound depicted in Figure 34 was synthesized as follows. 

20 1. Preparation of n-nonanoyl-RANTES(2-33)(Tyr3Chg)-thioester 

nonanoyl-PXSSDTTPCC FAYIARPLPR AHIKEYFYTS GK- 
thioester 

where X= L-Cyclohexylglycine (Chg) 

n-nonanoyl-RANTES(2-33)(Tyr3Chg) was synthesized on a thioester- 
25 producing resin as described above in Example 18, and then modified at the N- 
terminus by direct coupling of n-nonanoic acid to the resin. The peptide resin was 
cleaved with hydrogen fluoride and purified by reverse-phase HPLC to yield n- 
nonanoyl-RANTES(2-33)(Tyr3Chg). Observed mass = 41512.0 Da; Calculated 
mass = 4152.6 Da (average isotope composition). 



10 
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2. Preparation of RANTES(34-68)(Met67Lys(Lev))-Lys69-Leu70 

CSNPAVVFVT RKNRQVCANP EKKWVREYIN SLEK (Lev) SKL 

RANTES(34-68)(Met67Lys)-Lys69(Fmoc)-Leu70 was synthesized by 
conventional SPPS on a Boc-Leu-OCH2-Pam-resin using the in situ 
5 neutralization/HBTU activation protocols for Boc chemistry SPPS as described 
above in Example 18. 

After completion of chain assembly the Fmoc group on the epsilon nitrogen 
of Lys67 was removed from the protected peptide-resin with 20% piperidine in DMF 
treatment. Levulinic acid was activiated as the symmetrical anhydride with 1,3-di- 
10 isopropylcarbodi-imide and coupled to the epsilon nitrogen of Lys67. The peptide- 
resin was cleaved with hydrogen fluoride and the product purified by reverse-phase 
preparative HPLC to yield RANTES(34-68)(Met67Lys(Lev))-Lys69-Leu70. 
Observed mass = 4433.2 Da; Calculated mass = 4434.2 Da (average isotope 
composition). 
15 3. Preparation of G1805 

nonanoyl-PXSSDTTPCC FAYIARPLPR AHIKEYFYTS 

GKCSNPAVVF VTRKNRQVCA NPEKKWVREY 

INSLEK (Lev-GP2 9 ) SKL (SEQ ID NO.-.6) 

where X= L-Cyclohexylglycine (Chg), and where GP29 = branched oxime- 
20 linked water-soluble polymer construct depicted in Figure 34 and prepared as 
described in Example 4. 

a. Assembly of Full-Length Polypeptide By Native Chemical 
Ligation 

The N-terminal and C-terminal peptide segments were joined together by 
25 native chemical ligation as described above in Example 18. The full-length 
polypeptide in reduced form was purified using reverse-phase HPLC with a linear 
gradient of acetonitrile versus water containing 0.1% trifluoroacetic acid. Observed 
mass = 8213.6 Da; Calculated mass = 8216.6 Da (average isotope composition). 
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b. Attachment of GP29 to the of Ligated, Full-Length Polypeptide 

The lyophilized full-length polypeptide was dissolved and co-lyophilized 
with an equimolar amount of the aminooxyacetyl (AoA)-containing branched water- 
soluble polymer construct GP29. The dried powder was dissolved and purified by 
5 reverse-phase HPLC to yield the full-length polypeptide with GP29 covalently 
attached by an oxime bond to die keto functionality on the modified side chain of the 
lysine at position 67. Observed mass = 23,836 Da; Calculated mass = 23,845 Da 
(average isotope composition). Alternatively, GP29 was covalently attached after 
folding the polypeptide, however this was observed to give a lower yield. 

c. Folding and Purification of Polypeptide containing Oxime- 
LinkedGP29 

The full-length polypeptide with GP29 attached was folded with concomitant 
formation of 2 disulfide bonds in aqueous buffer containing a cysteine-cystine redox 
couple, and purified by reverse-phase HPLC as described above in Example 18. The 
folded product was homogeneous on HPLC. Observed mass = 23,832 Da; 
Calculated mass = 23,840 Da (average isotope composition). 

Example 21 
Preparation of RANTES Analog G1806 

The RANTES analog, G1806, depicted in Figure 35, was synthesized as 
20 follows. 

1. Preparation of n-nonanoyl-RANTES(2-33)(Tyr3Chg)-thioester 

nonanoyl-PXSSDTTPCC FAYIARPLPR AHIKEYFYTS 
GK-thioester 

where X= L-Cyclohexylglycine (Chg) 

25 Peptide segment n-nonanoyl-RANTES(2-33)(Tyr3Chg)-thioester (thioester 

also depicted as -COSR, where R is alkyl group) was synthesized on a thioester 
producing resin as described above in Example 18, and then modified at the N- 
terminus by direct coupling of n-nonanoic acid to the resin. The peptide resin was 
cleaved with hydrogen fluoride and purified by reverse-phase HPLC to yield n- 



10 



15 



WO 02/20033 



-165- 



PCT7US01/21930 



nonanoyl-RANTES(2-33) Tyr3Chg. Observed mass = 4152.0 Da; Calculated mass 
= 4152.6 Da (average isotope composition). 

2. Preparation of RANTES(34-68)(Met67Lys(Lev))-Lys69(Palm)- 
Leu70 

5 CSNPAVVFVT RKNRQVCANP EKKWVREYIN 

SLEK (Lev) SK (Palm) L 

where K(lev) is a levulinic acid-modified epsilon nitrogen of Lys67 of the 
Met67Lys change from wild type RANTES; and where K(Palm) is a palmitate- 
modified epsilon nitrogen of Lys69 attached through an amino-octanoic moiety, 
1 0 having the structure: 

[sNitrogenLys69]-C(0)-CH2-CH2-CH2-CH2-CH 2 -CH 2 -CH2-NH-C(0)-(CH 2 )i4-CH3 

RANTES(34-68)(Met67Lys)-Lys69(Palm)-Leu70 was synthesized by 
conventional SPPS on a Boc-Leu-OCH2-Pam-resin using the in situ 
neutralization/HBTU activation protocols for Boc chemistry solid phase peptide 

15 synthesis as described above in Example 18. After coupling Boc-Lys(Fmoc)-OH at 
position 69, the Fmoc group was removed with 20% piperidine in DMF. Fmoc-8- 
amino-octanoic acid was activated with HBTU/DIEA and coupled to the epsilon 
nitrogen of Lys69. After Fmoc group removal, palmitic acid was activated with 1- 
hydroxy-7-azabenzotriazole (HATU) and 1,3-di-isopropylcarbodi-imide and coupled 

20 to the resin. The chain assembly was then completed using standard Boc chemistry 
SPPS procedures as described in Example 18. After completion of chain assembly 
the Fmoc group on the epsilon nitrogen of Lys67 was removed with 20% piperidine 
in DMF treatment. Levulinic acid was activated as the symmetrical anhydride with 
1,3-di-isopropylcarbodi-imide and coupled to the epsilon nitrogen of Lys67. The 

25 peptide resin was cleaved with hydrogen fluoride and the product purified by 
reverse-phase preparative HPLC to yield RANTES(34-68)(Met67Lys(Lev))- 
Lys69(Palmitate)-Leu70. Observed mass = 4813.7 Da; Calculated mass = 4813.8 
Da (average isotope composition). 
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3. Preparation of G1806 

nonanoyl-PXSSDTTPCC FAYIARPLPR AHIKEYFYTS 

GKCSNPAVVF VTRKNRQVCA NPEKKWVREY INSLEK (Lev- 
GP29) SK(Palm)L (SEQIDNO.:7) 

5 where X= L-Cyclohexylglycine (Chg), and where palmitate ("Palm") fatty 

acid attachment is through an aminooctanoic moiety: 

[sNitrogen Lys69]-C(0)-CH2-CH2-CH2-CH2-CH2-CH2-CH2-NH-C(0)-(CH 2 ) 1 4-CH3 

a. Assembly of Fatty Acid Modified Full Length Polypeptide By 
Native Chemical Ligation 

10 The N-terminal and C-terminal peptide segments were joined together by 

native chemical ligation as described in Example 18. The full-length polypeptide in 
reduced form was purified using reverse-phase HPLC with a linear gradient of 
acetonitrile versus water containing 0.1% trfluoroacetic acid. Observed mass = 
8598.2 Da; Calculated mass = 8596.3 Da (average isotope composition). 

b. Attachment of the Water-soluble Polymer GP29 to Ligated, Fatty 
Acid Modified Full Length Polypeptide 

The lyophilized full length polypeptide was dissolved and co-lyophilized 
with an equimolar amount of the AoA-containing polymer GP29. The dried powder 
was dissolved and purified by reverse-phase HPLC to yield the full length 
polypeptide with GP29 covalently attached by an oxime bond to the keto 
functionality on the modified side chain of the lysine at position 67. Observed mass 
= 24,217 Da; Calculated mass = 24,224. Da (average isotope composition). 
Alternatively, GP29 can be covalently attached after folding the polypeptide, 
however this was observed to give a lower yield. 

25 c. Folding and Purification of Polypeptide containing Oxime-linked 

GP29 and Fatty Acid 

The full length polypeptide with GP29 attached was folded with concomitant 
formation of 2 disulfide bonds in aqueous buffer containing a cysteine-cystine redox 
couple, and purified by reverse-phase HPLC as described above in Example 18. 
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The folded product was homogeneous on HPLC. Observed mass = 24,210 Da; 
Calculated mass = 24,220 Da (average isotope composition). 

Figure 36 shows analytical data representative of the final folded, purified 
synthetic Rantes analogs. In particular, a representative SDS-PAGE gel comparing 
5 the relative molecular weights of wild type (Wt) RANTES to synthetic chemokine 
analog RANTES G1806 under reducing (R) and non-reducing conditions (N) in 
shown in Figure 36. The relative molecular weights are depicted on the left hand 
side of each gel, which corresponds to a molecular weight standard run on the same 
gel (not shown). Also shown is a representative RP-HPLC chromatogram of the 
10 folded, purified G1806 product. This illustrates the purity and increased relative 
molecular weight of the precision polymer-modified constructs compared to wild 
type, native RANTES. 

Example 22 

Cell Fusion Assays for G1755-01, G1755, G1805 and G1806 

15 Cell fusion assays were performed to examine the anti-HIV activity of the 

various polymer-modified RANTES analogs. The ability of a given panel of 
compounds to inhibit CCR5-dependent cell fusion was determined using cells 
engineered to contain viral envelop proteins fusing with cells bearing CD4 and 
CCR5 and containing a reporter system. CCR5-mediated viral envelope-mediated 

20 cell fusion assays were carried out essentially as described in Simmons et al. 
(Science (1997) 276:276-279) using the cell lines HeLa-P5L and HeLa-Env-ADA, 
both of which were kindly provided by the laboratory of M. Alizon (Paris). Briefly, 
HeLa-P5L cells were seeded in 96-well plates (10 4 cells per well in 100 uf). Twenty- 
four hours later medium was removed and medium containing 1 0 4 HeLa-Env-ADA 

25 cells per well plus chemokines was added (200 ul final volume). After a further 
twenty-four hours, cells were washed once in PBS and lysed in 50 jul PBS/0.5 % NP- 
40 for 15 min at room temperature. Lysates were assayed for for p-galactosidase 
activity by the addition of 50 ju.1 2X CPRG substrate (16 mM chlorophenol red-|3-D- 
galactopyranoside; 120 mM Na 2 HP0 4 , 80 mM NaH 2 P0 4 , 20 mM KC1, 20 mM 

30 MgS04, and 10 mM (3-mercaptoethanol) followed by incubation for 1-2 hours in the 
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dark at room temperature. The absorbance at 575 nm was then read on a Labsystems 
microplate reader. From these values, percentage inhibition [100 x (OD( tes t) - 

OD(„egative control)) / OD( pos i t i V e control) - OD^egative control))] Was Calculated at each 

inhibitor concentration. A plot of percentage fusion inhibition against inhibitor 
5 concentration allowed the calculation of IC 50 values for each compound. 
Representative results are depicted in Table XII below. 



Table XII 

HeLa P4-CCR5 / HeLa-Env-ADA Cell Fusion Data 



Compound 


EC50 


AOP-RANTES (2-68) 


1.81 xl0" y M 


NNY-RANTES (2-68) 


3.23 x 10- ,0 M 


RANTES G1755-01 


11.00xlO' 9 M 


RANTES G1755 


1.87xl0" 9 M 


RANTES G1805 


1.40xlO" 9 M 


RANTES G 1806 


2.98xlO- 10 M 



These results demonstrate the substantial retention of anti-fusion activity 
following site-specific modification with either the linear or branched water-soluble 
polymer constructs. This result was surprising given the observed increase in 

10 activity of various RANTES analogs when a hydrophobic moiety, such as a fatty 
acid, was attached to the C-terminus, which supported the hypothesis that increasing 
the hydrophobic nature of the RANTES analogs in general would increase anti- 
fusion activity. In contrast, the water-soluble polymers used for modification, 
particularly the branched water-soluble polymers, were quite hydrophilic and 

15 negatively charged, yet only a marginal reduction in activity was observed that was 
well within the target potency range sought for in vivo activity. Even more 
unexpected was the relative retention in activity of the RANTES analogs modified 
with the large, negatively charged branched water-soluble polymer constructs 
compared to the smaller, linear water-soluble polymer constructs, and that the small 

20 linear and large branched polymer modified analogs had substantially the same 
effect on activity. This later observation is believed to be due in part to anti- 
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aggregation properties when water-soluble polymer modifications are made internal 
to the pendant C-terminal residue of wild type RANTES at an aggregation region, as 
designed, and/or the localized anti-fusion properties of the water-soluble polymers. 
The additional potency-increasing changes compensated for the marginal 
5 activity loss from attachment of water-soluble polymers, relative to AOP-RANTES 
and NNY-RANTES. These changes included the introduction of one or more 
hydrophobic amino acid derivatives to the N-terminus, and addition of a 
hydrophobic fatty acid moiety to the C-terminus. These results indicate that 
additional changes in combination with the monodisperse polyamide ethylene 
10 polymers (or other water-soluble polymers) can improve overall potency of NNY- 
and AOP-type modified chemokine molecules, such as AOP- or NNY-RANTES 
having the combined Pro2Thz and Tyr3Chg change to the N-terminus and/or the 
addition of a lipid moiety to the C-terminus. 

Example 23 

1 5 Pharmacokinetic Studies for G1755, G1805, and G1806 

Pharmacokinetic studies of polymer-modified RANTES analogs were 
performed in male Sprague-Dawley rats as follows. RANTES analogs G1755, 
G1805, and G1806, prepared as described above (along with AOP-RANTES as 
control), were formulated for intravenous (TV) injection just prior to injection from 

20 lyophilized protein stock in phosphate buffered saline at pH 7.0. Formulations were 
prepared so as to provide equimolar doses of the RANTES analogs to individual test 
animals [400ug/kg (AOP-RANTES); 510 ug/kg (G1755); 1210ug/kg (G1805); and 
1225 ug/kg (G1806) (ug analog / kg animal)]. Final concentrations were adjusted 
where necessary to provide a total dose volume of 1 ml/kg for each animal (i.e., dose 

25 volumes were held constant). Each formulated analog was administered 
intravenously via the tail of the rats on day 1 of the experiment, and each animal 
received a single dose. 

Following injection, approximately 0.25 ml of blood were collected from the 
tail vein/artery over the course of the experiment at each sample collection time 

30 point, and plasma or serum samples were prepared using EDTA as anticoagulant 
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following standard protocols and the National Institutes of Health animal care 
guidelines and recommended testing procedures. Sample collection times were 
adjusted based on initial data collection to avoid collection of more than 14 samples 
of 0.25 ml from any animal within 3 weeks (as established by the National Institutes 
5 of Health animal care guidelines). Additional samples were collected at weekly 
intervals where detectable blood levels persisted beyond the initial samples. The 
plasma concentration of each RANTES analog at each time point was determined 
using Quantikine®Elisa, Human RANTES Immunoassay, (R&D Systems 
Catalog#DRN00), per the manufacturers instructions. Overall health of the animals 

10 was monitored over the course of the experiment, including weight, eating habits, 
disposition, appearance etc., and no apparent side effects were observed. 
Representative results are depicted in Figure 37. 

These results demonstrate the substantive increase in circulating half-life 
provided by attachment of water-soluble polymers to the various precursor RANTES 

15 analogs, with the apparent increase in half-life of G1805 > G1806 > G1755 > AOP- 
RANTES. Relative to AOP-RANTES, the increase was about 40-fold for G1805, 
20-fold for G1806, and 10-fold for G1755. Collectively, the balance of retained high 
potency and significantly improved circulating half-life for the water-soluble 
polymer modified RANTES analogs is expected to enhance the therapeutic efficacy 

20 of these compounds in an in vivo setting, and reduce the number and amount of 
doses needed for treatment. 

While the invention has been described in connection with specific 
embodiments thereof, it will be understood that it is capable of further modifications 
and this application is intended to cover any variations, uses, or adaptations of the 
25 invention following, in general, the principles of the invention and including such 
departures from the present disclosure as come within known or customary practice 
within the art to which the invention pertains and as may be applied to the essential 
features hereinbefore set forth. 
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SEQUENCE LISTING 

<110> Gryphon Sciences 

<120> Polyme- Modified Synthetic Proteins 
<130> 03504.269 



<160> 7 

20 <170> Patentln Ver . 2.1 



<210> 1 

<211> 166 

<212> PRT 

25 <213> Homo sapiens 

<400> 1 

Ala Pro Pro Arg Leu lie Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
15 10 15 

30 

Leu Glu Ala Lys Glu Ala Glu Lys lie Thr Thr Gly Cys Ala Glu His 
20 25 30 

Cys Ser Leu Asn Glu Lys He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 35 40 45 

Tyr Ala Trp Lys Arg Met Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 60 

40 Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Leu 

65 70 75 80 

Leu Val Lys Ser Ser Gin Pro Trp Cys Pro Leu Gin Leu His Val Asp 

85 90 95 

45 

Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu 
100 105 110 



Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val 
130 135 140 

55 Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala 
145 150 155 160 
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<210> 2 
<211> 166 
<212> PRT 
^ <213> Homo sapiens 

<400> 2 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
15 10 15 

10 Leu Glu Ala Lys Glu Ala Glu Lys He Thr Thr Gly Cys Ala Glu His 
20 25 30 

Cys Ser Leu Asn Glu Lys He Thr Val Pro Asp Thr Lys Val Asn Phe 



15 



55 



Tyr Ala Trp Lys Arg Met Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 60 



Leu Val Lys Ser Ser Gin Pro Trp Cys Pro Leu Gin Leu His Val Asp 

85 90 95 

25 Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu 
100 105 110 

Gly Ala Gin Lys Cys Ala He Ser Pro Pro Asp Ala Ala Lys Ala Ala 
115 120 125 



Cys Arg Thr Gly Asp Arg 

165 

40 

<210> 3 

<211> 166 

<212> pr:- 

<213> Homo sapiens 

45 

<400> 3 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
15 10 15 

50 Leu Glu Ala Lys Glu Ala Glu Cys He Thr Thr Gly Cys Ala Glu His 
20 25 30 

Cys Ser Leu Asn Glu Cys He Thr Val Pro Asp Thr Lys Val Asn Phe 



Tyr Ala Trp Lys Arg Met Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 60 
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Leu Ala Cys Ser Ser Gin Pro Trp Glu Pro Leu Gin Leu His Val Asp 



10 Pro Leu Arg Thr lie Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val 
130 135 140 



20 <210> 4 
<211> 69 
<212> PRT 

<213> Homo sapiens 
25 <400> 4 

Pro Tyr Ser Ser Asp Thr Thr Pro Cys Cys Phe Ala Tyr lie Ala Arg 
15 10 15 

Pro Leu Pro Arg Ala His lie Lys Glu Tyr Phe Tyr Thr Ser Gly Lys 
30 20 25 30 

Cys Ser Asn Pro Ala Val Val Phe Val Thr Arg Lys Asn Arg Gin Val 
35 40 45 

35 Cys Ala Asn Pro Glu Lys Lys Trp Val Arg Glu Tyr lie Asn Ser Leu 
50 55 60 

Glu Met Ser Lys Leu 
65 

40 

<210> 5 
<211> 69 
<212> PRT 
45 <213> Homo sapxens 

<400> 5 

Pro Tyr Ser Ser Asp Thr Thr Pro Cys Cys Phe Ala Tyr He Ala Arg 
1 5 10 15 

50 

Pro Leu Pro Arg Ala His He Lys Glu Tyr Phe Tyr Thr Ser Gly Lys 
20 25 30 

Cys Ser Asn Pro Ala Val Val Phe Val Thr Arg Lys Asn Arg Gin Val 
55 35 40 45 

Cys Ala Asn Pro Glu Lys Lys Trp Val Arg Glu Tyr He Asn Ser Leu 
50 55 60 

60 Glu Met Ser Lys Leu 
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<210> 6 
<211> 69 
5 <212> PRT 

<213> Homo sapiens 



10 



Pro Leu Pro Arg Ala His lie Lys Glu Tyr Phe Tyr Thr Ser Gly Lys 

20 25 30 

15 Cys Ser Asn Pro Ala Val Val Phe Val Thr Arg Lys Asn Arg Gin Val 

35 40 45 

Cys Ala Asn Pro Glu Lys Lys Trp Val Arg Glu Tyr lie Asn Ser Leu 



20 



Glu Lys Ser Lys Leu 
65 



25 <210> 7 
<211> 69 
<212> PRT 
<213> Homo sapiens 

30 <400> 7 

Pro Xaa Ser Ser Asp Thr Thr Pro Cys Cys Phe Ala Tyr lie Ala Arg 
15 10 15 

Pro Leu Pro Arg Ala His lie Lys Glu Tyr Phe Tyr Thr Ser Gly Lys 
35 20 25 30 

Cys Ser Asn Pro Ala Val Val Phe Val Thr Arg Lys Asn Arg Gin Val 
35 40 45 

40 Cys Ala Asn Pro Glu Lys Lys Trp Val Arg Glu Tyr He Asn Ser Leu 
50 55 60 

Glu Lys Ser Lys Leu 
65 

45 
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What Is Claimed Is : 

1. An isolated synthetic bioactive protein having the formula Protein-Ura-B- 
Polymer-J*, wherein Protein comprises a polypeptide chain of a ribosomally 
specified protein, said polypeptide chain comprising one or more non- 
5 overlapping peptide segments covalently bonded by one or more chemical 

ligation sites, U is a residue of a unique functional group covalently bonded 
to a mutually reactive unique functional group of a side chain n of one or 
more amino acids of one or more of said non-overlapping peptide segments, 
where n is a discrete integer selected from 1 to 6, B is a branching core 
10 having three or more arms that may be the same or different and may be 

present or absent, Polymer is a substantially non-antigenic water-soluble 
polymer, and J* is a pendant group having a net charge under physiological 
conditions selected from the group consisting of negative, neutral and 
positive. 

15 2. The synthetic bioactive protein of claim 1, wherein said synthetic bioactive 
protein comprises a monomer molecular weight of greater than 25,000 
Daltons. 

3. The synthetic bioactive protein of claim 1, wherein said synthetic bioactive 
protein possesses a bioactivity that mimics a bioactivity associated with said 

20 ribosomally specified protein. 

4. The synthetic bioactive protein of claim 1, wherein said ribosomally 
specified protein is a mammalian protein. 

5. The synthetic bioactive protein of claim 4, wherein said mammalian protein 
is a cytokine, 

25 6. The synthetic bioactive protein of claim 4, wherein said cytokine is selected 
from the group consisting of an interleukin, an erythropoiesis stimulating 
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protein, an interferon, a lymphokine, a chemokine, a growth factor, a colony 
stimulating factor, and a signal peptide hormone. 

7. The synthetic bioactive protein of claim 4, wherein said mammalian protein 
is a human protein. 

5 8. The synthetic bioactive protein of claim 1, wherein said ribosomally 
specified protein comprises a ribosomally specified glycosylation site. 

9. The synthetic bioactive protein of claim 8, wherein U is covalently bonded to 
said side chain n at a site of said polypeptide chain corresponding to a 
ribosomally specified glycosylation site. 

10 10. The synthetic bioactive protein of claim 9, wherein U is covalently bonded to 
said side chain n exclusively at a site corresponding to said ribosomally 
specified glycosylation site. 

11. The synthetic bioactive protein according to claim 1, wherein said 
ribosomally specified protein is a recombinantly produced protein. 

15 12. The synthetic bioactive protein of claim 11, wherein said recombinantly 
produced protein is a non-natural protein. 

13. The synthetic bioactive protein claim 12, wherein said non-natural protein 
has one or more non-natural ribosomally specified glycosylation sites. 

14. The synthetic bioactive protein of claim 1, wherein said polypeptide chain 
20 comprises one or more irregular amino acids. 

15. The synthetic bioactive protein of claim 14, wherein said irregular amino 
acid has a side other than one of the 20 genetically encoded amino acids. 



25 



16. 



The synthetic bioactive protein of claim 15, wherein said irregular amino 
acid is selected from the group consisting of a pseudo amino acid and a 
hydrophobic amino acid derivative. 
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17. The synthetic bioactive protein of claim 1, wherein said chemical ligation 
site comprises a bond selected from the group consisting of amide, oxime, 
hydrazone, thioester, thaizolidine, oxazolidine and thioether. 

18. The synthetic bioactive protein of claim 1, wherein said side chain n is a non- 
5 natural side chain. 

19. The synthetic bioactive protein of claim 1, wherein U is covalently bonded to 
said side chain n through a bond formed by chemical ligation. 

20. The synthetic bioactive protein of claim 19, wherein said bond formed by 
chemical ligation is selected from the group consisting of amide, oxime, 

10 hydrazone, thioester, thaizolidine, and oxazolidine. 

21. The synthetic bioactive protein of claim 1, wherein one or more of U, B, 
Polymer and J* are separated by a spacer or linker, and having the formula 
U-sl-B-s2-Polymer-s3-J*, where si, s2, and s3 are spacer or linker moieties 
that may be the same or different, and may be individually present or absent. 

15 22. The synthetic bioactive protein of claim 1, wherein U is a residue of a bond 
selected from oxime, amide, amine, urethane, ether, thioether, ester, 
thioester, hydrazide, oxazolidine, and thaizolidine. 

23. The synthetic bioactive protein of claim 1, wherein one arm of B is joined to 
U, and a second arm of B is joined to Polymer. 

20 24. The synthetic bioactive protein of claim 1, wherein B comprises four or more 
arms. 

25. The synthetic bioactive protein of claim 1, wherein at least one of said arms 
comprises a residue of bond selected from the group consisting of oxime, 
amide, amine, urethane, thioether, ester, thioester, hydrazide, oxazolidine, 
25 and thaizolidine. 
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26. The synthetic bioactive protein of claim 1, wherein B comprises a branching 
core selected from the group consisting of amino, carboxylate and mixed 
amino-c 



27. The synthetic bioactive protein of claim 26, wherein said amino branching 
core comprises lysine, said carboxylate branching core comprises glutamic or 
aspartic acid, and said mixed amino-carboxylate branching core comprises 
gamma-glutamic acid. 

28. The synthetic bioactive protein of claim 1, wherein Polymer is selected from 
the group consisting of polyalkylene oxide, polyamide alkylene oxide and 
derivatives thereof. 

29. The synthetic bioactive protein of claim 1, wherein said side chain n is at a 
chemical ligation site of said polypeptide chain. 

30. The synthetic bioactive protein of claim 1, wherein B-Polymer-J* comprises 
a molecular weight of between 1,500 and 80,000 Daltons. 

31. The synthetic bioactive protein of claim 1, wherein side chain n is 2 or more, 
and wherein one or more of B-Polymer-J* is different. 

32. The synthetic bioactive protein of claim 1, wherein B-Polymer-J* comprises 
a net charge under physiological conditions that is selected from the group 
consisting of negative, neutral and positive. 

33. The synthetic bioactive protein of claim 1, wherein B-Polymer-J* is mono- 
disperse. 

34. The synthetic bioactive protein of claim 1, wherein said synthetic bioactive 
protein is mono-disperse. 

35. The synthetic bioactive protein of claim 1, wherein Polymer is a polyamide 
of the formula -[C(0)-X-C(0)NH-Y-NH]n- or -[NH-Y-NH-C(0)-X- 
C(0)]n-, where X and Y are divalent radicals that may be the same or 
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different and may be branched or linear, and n is an integer from 2 to 100, 
and where either or both of X and Y comprises a water-soluble repeat unit 
that may be linear or branched. 

36. The synthetic bioactive protein of claim 35, wherein said X is a divalent 
radical of the formula -NH-X'- or -X'-NH-, where X' is a divalent radical 
comprising a water-soluble repeat unit that may be linear or branched. 

37. The synthetic bioactive protein of claim 35, wherein said Y is a divalent 
radical of the formula -CO-Y'- or-Y'-CO-, where Y' is a divalent radical 

38. The synthetic bioactive protein of claim 35, wherein n is a discrete integer 
from 2 to 50. 

39. The synthetic bioactive protein of claim 35, wherein said water-soluble 
repeat unit comprises and ethylene oxide repeat unit of the formula -(CH 2 - 
CH 2 -0)- or -(O-CH2-CH2)-. 

40. The synthetic bioactive protein of claim 39, wherein said ethylene oxide 
repeat unit has the formula -(CH2-CH2-O),,- or -(0-CH2-CH2) n -, where n is a 
discrete integer from 2 to 50. 

41. The synthetic bioactive protein of claim 1, wherein J is an ionizable group 
having a net charge under physiological conditions selected from the group 
consisting of negative and positive. 

42. The synthetic bioactive protein of claim 41, wherein said ionizable group is 
selected from carboxyl, amine, and hydroxyl. 

43. The synthetic bioactive protein of claim 1, wherein J is a non-ionizable group 
having a net neutral charge under physiological conditions. 

44. The synthetic bioactive protein of claim 35, having an increased circulating 
half-life in a mammal of greater compared to a corresponding synthetic 
bioactive protein that is devoid of said water-soluble polymer. 
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45. A synthetic bioactive protein comprising a polypeptide chain comprising an 
amino acid sequence of a ribosomally specified glycoprotein, said 
polypeptide chain having one or more water-soluble polymers attached 
thereto and a monomer molecular weight of greater than 25 kDa. 

46. The synthetic bioactive protein of claim 45, wherein said synthetic bioactive 
protein possesses a bioactivity that mimics a bioactivity associated with said 
ribosomally specified glycoprotein. 

47. The synthetic bioactive protein of claim 46, wherein said ribosomally 
specified glycoprotein is a cytokine. 

48. The synthetic bioactive protein of claim 47, wherein said cytokine is selected 
from the group consisting of an erythropoiesis stimulating protein, Rantes, 
and granulocyte colony stimulating factor. 

49. The synthetic bioactive protein of claim 45, wherein said ribosomally 
specified glycoprotein comprises one or more glycosylation sites. 

50. The synthetic bioactive protein of claim 49, wherein said water-soluble 
polymer is attached to said polypeptide chain at one or more sites 
corresponding to said one or more glycosylation sites. 

51. The synthetic bioactive protein of claim 50, wherein said water-soluble 
polymer is attached to said polypeptide chain exclusively at one or more sites 
corresponding to said one or more glycosylation sites. 

52. The synthetic bioactive protein of claim 45, wherein said ribosomally 
specified glycoprotein is a recombinantly produced glycoprotein. 

53. The synthetic bioactive protein of claim 52, wherein said recombinantly 
produced glycoprotein is a non-natural glycoprotein. 



25 54. The synthetic bioactive protein of claim 53, wherein said non-natural 
glycoprotein has one or more non-natural glycosylation sites. 
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55. The synthetic bioactive protein of claim 45, wherein said polypeptide chain 
comprises one or more irregular amino acids. 

56. The synthetic bioactive protein of claim 45, wherein said water-soluble 
polymer comprises a molecular weight of between 1,500 and 80,000 Daltons. 

57. The synthetic bioactive protein of claim 45, wherein said water-soluble 
polymers comprises a net charge under physiological conditions that is 
selected from the group consisting of negative, neutral and positive. 

58. The synthetic bioactive protein of claim 45, wherein Polymer is selected 
from the group consisting of polyalkylene oxide, polyamide alkylene oxide 
and derivatives thereof. 

59. The synthetic bioactive protein of claim 45, wherein Polymer is a polyamide 
of the formula -[C(0)-X-C(0)NH-Y-NH]n- or -[NH-Y-NH-C(0)-X- 
C(0)]n-, where X and Y are divalent radicals that may be the same or 
different and may be branched or linear, and n is an integer from 2 to 100, 
and where either or both of X and Y comprises a water-soluble repeat unit 
that may be linear or branched. 

60. The synthetic bioactive protein of claim 59, wherein n is a discrete integer 
from 2 to 50. 

61. The synthetic bioactive protein of claim 59, wherein said water-soluble 
repeat unit comprises and ethylene oxide repeat unit of the formula -(CH 2 - 
CH 2 -0)- or -(O-CH2-CH2)-. 

62. A pharmaceutical composition comprising a synthetic bioactive protein 
according to any one of claims 1 through 61, or pharmaceutically acceptable 
salts thereof. 



25 



63. A pharmaceutical composition according to claim 62, which comprises one 
or more excipients selected from the group consisting of a buffer, a carrier 
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protein, an amino acid, a detergent, a lipid, a water soluble polymer, and a 
preservative. 

64. A pharmaceutical composition according to claim 63, which comprises one 
or more additional bioactive agents other than said synthetic bioactive 

5 protein. 

65. A method of treating a human disease or condition that comprises 
administering to an individual in need of such treatment an effective amount 
of a pharmaceutical composition comprising one or more molecularly 
homogeneous pharmaceutical compositions of a polymer-modified synthetic 

10 bioactive protein having a monomer molecular weight greater than about 25 

kDa and that possesses a bioactivity that mimics a bioactivity associated with 
a natural human protein receptor or fragment thereof, protein receptor ligand 
or fragment thereof, or a cytokine. 

66. The method of claim 65, wherein said synthetic bioactive protein has a 
1 5 bioactivity of a cytokine 

67. The method of claim 66, wherein said cytokine is selected from the group 
consisting of an interleukin, an erythropoiesis stimulating protein, an 
interferon, a lymphokine, a chemokine, a growth factor, a colony stimulating 
factor, and a signal peptide hormone. 

20 68. A pharmaceutical composition of 62 for use in treating a human disease or 
condition associated with a bioactivity or lack thereof of a natural human 
protein receptor or fragment thereof, protein receptor ligand or fragment 
thereof, or a cytokine. 

69. The pharmaceutical composition of claim 68, wherein said polymer-modified 
25 synthetic bioactive protein has a bioactivity of a cytokine. 



70. The pharmaceutical composition of claim 69, wherein said cytokine is 
selected from the group consisting of an interleukin, an erythropoiesis 
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stimulating protein, an interferon, a lymphokine, a chemoldne, a growth 
factor, a colony stimulating factor, and a signal peptide hormone. 

71 . A method of producing a polymer-modified polypeptide chain of a synthetic 
bioactive polymer-modified protein, said method comprising chemically 
ligating peptide segments comprising non-overlapping amino acid sequences 
of a polypeptide chain of said synthetic bioactive polymer-modified protein, 
wherein one or more of said peptide segments has a water-soluble polymer 
attached thereto, whereby a polymer-modified polypeptide chain of said 
synthetic bioactive polymer-modified protein is produced. 

72. The method of claim 71, wherein said polypeptide chain comprises an amino 
acid sequence of a ribosomally specified protein. 

73. The method of claim 72, wherein said ribosomally specified protein is a 
mammalian protein. 

74. The method of claim 73, wherein said mammalian protein is a cytokine. 

75. The method of claim 74, wherein said cytokine is selected from the group 
consisting of an interleukin, an erythropoiesis stimulating protein, an 
interferon, a lymphokine, a chemokine, a growth factor, a colony stimulating 
factor, and a signal peptide hormone. 

76. The method of claim 71, which further comprises folding said polymer- 
modified polypeptide chain whereby a synthetic bioactive polymer-modified 
protein is produced. 

77. The method of claim 71, wherein one or more of said peptide segments are 
partially protected. 



78. The method of claim 71, wherein one or more of said peptide segments are 
25 unprotected. 
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79. The method of claim 71, wherein one or more of said peptide segments 
comprises one or more irregular amino acids. 

80. The method of claim 71, wherein said chemically Iigating comprises a 
chemoselective ligation chemistry selected from native chemical ligation, 
extended native chemical ligation and pseudo native chemical ligation. 

81. The method of claim 71, wherein said water-soluble polymer comprises a 
molecular weight of between 1,500 and 80,000 Daltons. 

82. A method of producing a polymer-modified polypeptide chain of a synthetic 
bioactive polymer-modified protein, said method comprising: 

a. chemically Iigating peptide segments comprising non-overlapping 
amino acid sequences of said synthetic bioactive polymer-modified 
protein to form a full length polypeptide chain corresponding to the 
amino acid sequence of said synthetic bioactive polymer-modified 
protein, wherein one or more of said peptide segments comprises a 
first unique chemoselective group; and 

b. chemically Iigating a water-soluble polymer to said full length 
polypeptide chain, said water-soluble polymer comprising a second 
chemoselective group that is mutually and uniquely reactive with said 
first unique chemoselective group, whereby a polymer-modified 
polypeptide chain of said synthetic bioactive polymer-modified 
protein is produced. 

83. The method of claim 82, which further comprises folding said polymer- 
modified polypeptide chain whereby a synthetic bioactive polymer-modified 
protein is produced. 

84. A method of producing a synthetic bioactive polymer-modified protein, said 
method comprising 

a. chemically Iigating peptide segments comprising non-overlapping 
amino acid sequences of said synthetic bioactive polymer-modified 
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protein to form a full length polypeptide chain corresponding to the 
amino acid sequence of said synthetic bioactive polymer-modified 
protein, wherein one or more of said peptide segments comprises a 
non-native side chain comprising a first unique chemoselective group 
5 that is unreactive with a side chain functional group of an amino acid 

of a genetically encoded amino acid; 

b. folding said full length polypeptide chain to form a folded 
polypeptide chain; and 

c. chemically ligating a water-soluble polymer to said folded 
10 polypeptide chain, said water-soluble polymer comprising a second 

chemoselective group that is mutually and uniquely reactive with said 
first unique chemoselective group, whereby a polymer-modified 
polypeptide chain of said synthetic bioactive polymer-modified 
protein is produced. 

15 85. A molecularly homogeneous water-soluble polymer of the formula: 
U-sl-B-s2-Polymer-s3-J* 
where U is a residue of a unique functional group, B is a branching core 
having three or more arms that may be the same or different and may be 
present or absent, Polymer is a polyamide having a molecular weight greater 

20 than about 5,000 Da of the formula -[C(0)-X-C(0)-NH-Y-NH]n- or -[NH- 

Y-NH-C(0)-X-C(0)]n-, where X and Y are divalent radicals that may be the 
same or different and may be branched or linear, and n is a discrete integer 
from 2 to 50, and where either or both of X and Y comprises a substantially 
non-antigenic water-soluble repeat unit that may be linear or branched, J* is 

25 a residue of a substantially non-antigenic pendant group having a net charge 

under physiological conditions selected from the group consisting of 
negative, positive and neutral, and where si, s2, and s3 are spacer or linker 
moieties that may be the same or different, and may be individually present 
or absent. 
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86. The water-soluble polymer of claim 85, wherein U is a residue of a 
functional group selected from the group consisting of acrylate, aldehyde, 
ketone, aminooxy, amine, carboxylic acid, ester, thioester, halogen, thiol, 
cyanoacetate, dipalmitoyl phosphatidylethanolamine, distearoyl 
phosphatidylethanolamine, epoxide, hydrazide, azide, isocyanate, 
maleimide, methacrylate, nitrophenyl carbonate, orthopyridyl disulfide, 
silane, sulfhydryl, vinyl sulfones, succinimidyl glutarate, succimidyl 
succinate, succinic acid, tresylate and an activatable functional group. 

87. The water-soluble polymer of claim 85, wherein U is a residue of a 
functional group capable of forming a bond selected from the group 
carbonate, ester, urethane, orthoester, amide, amine, oxime, imide, urea, 
thiourea, thioether, thiourethane, thioester, ether, thaizolidine, hydrazone, 
and oxazolidine. 

88. The water-soluble polymer of claim 85, wherein U is protected. 

89. The water-soluble polymer of claim 85, wherein one or more of si, s2 
and s3 are present and are selected comprise from 1 to 18 carbons. 

90. The water-soluble polymer of claim 85, wherein one or more of si, s2 
and s3 are present and are selected and comprise Polymer. 

91. The water-soluble polymer of claim 85, wherein si comprise Polymer. 

92. The water-soluble polymer of claim 85, wherein B comprises four or 
more arms. 

93. The water-soluble polymer of claim 85, wherein said molecular weight is 
greater than about 10,000 Da. 



The water-soluble polymer of claim 93, wherein said molecular weight is 
greater than about 15,000 Da. 
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95. The water-soluble polymer of claim 85, wherein B comprises a branching 
core joined to s2 or Polymer through one or more amide bonds. 

96. The water-soluble polymer of claim 85, wherein B comprises a branching 
core joined to Polymer through amide bonds. 

5 97. The water-soluble polymer of claim 85, wherein said repeat unit 

comprises an ethylene oxide of the formula -(CH 2 -CH 2 -0)- or -(CH 2 - 
CH 2 -0)-. 

98. The water-soluble polymer of claim 85, wherein X, Y, or X and Y are 
selected from: -((CH 2 )ni-(CH 2 -CH 2 -0) n2 -(CH 2 ) nI -)- or -((CH 2 ) nl -(0- 

10 CH 2 -CH 2 ) n2 -(CH 2 ) n r), where nl is a discrete integer from 1 to 6, n2 is a 

discrete integer from 2 to 50. 

99. The water-soluble polymer of claim 85, wherein one of X or Y is selected 
from the group consisting of phenyl, a Ci-Cjo alkylene moiety, a C\- 
C\q alkyl group, a heteroatom-containing phenyl, a heteroatom- 

15 containing Cj-Cjo alkylene moiety, a heteroatom-containing Ci-Ciq 

alkyl group, and a combination thereof. 

1 00. The water-soluble polymer of claim 96, where X is -(CH 2 -CH 2 )-. 

101. The water-soluble polymer of claim 1 00, wherein Y is -(CH 2 -(CH 2 -CH 2 - 
0) 3 -CH 2 -CH 2 -CH 2 )n- or -(CH 2 -CH 2 -CH 2 -(0-CH 2 -CH 2 ) 3 -CH 2 )-n , where 

20 n is 6 to 36. 

102. The water-soluble polymer of claim 85, wherein J* is protected. 



103. The water-soluble polymer of claim 85, wherein J* comprises a non- 
ionizable group having a net neutral charge under physiological 
conditions. 
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104. The water-soluble polymer of claim 85, wherein J* comprises an 
ionizable group that comprises a net positive charge under physiological 
conditions. 

105. The water-soluble polymer of claim 85, wherein J* comprises an 
ionizable group that comprises a net negative charge under physiological 
conditions. 

106. The water-soluble polymer of claim 85, wherein X is -X'-NH- or 
-NH-X'-. 

107. The water-soluble polymer of claim 85, wherein Y is -Y'-NH- or 
-NH-Y'-. 

108. The water-soluble polymer of claim 107, wherein at least one of X' and 
Y' comprise a polyethylene oxide of formula (-CH2-CH 2 -0-)n or (O- 
CH 2 -CH 2 -)n where n is 2 to 50. 

109. The water-soluble polymer according to claim 85, wherein said water- 
soluble polymer is mono-disperse. 
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